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A PIEBALD FAMILY. 


By E. A. COCKAYNE, M.D., M.R.C.P. 


In spite of the great interest, which they have always excited, well 
authenticated examples of piebalds in the dark races have been found to be rare. 
In the white races they are much less conspicuous, in part owing to the presence 
of clothing and in part owing to the lack of contrast between the pigmented 
and unpigmented skin, but the likelihood of their coming under the notice of 
a skilled observer is much greater. The scarcity of records shows that piebalds 
in the white races also must be very uncommon. Last year I met with a case 
in a baby, and found that the child belonged to a family, many of whose members 
showed a similar defect of pigmentation. The family, belonging to a farming 
stock, originally came from the neighbourhood of Bury St Edmunds in Suffolk, and 
the anomaly is known to have descended directly through six generations. The 
oldest member, with whom I have talked, is fairly certain that it was present in 
one generation at least before this. 

Of the first two generations in the pedigree (see Plate XI), I could obtain no 
definite information except the statement as to the existence of the piebaldism in 
I. 1 and II. 1, but of the third, ITI. 2 is said to have had a frontal blaze of white 
hair and white skin on the neck and forearms, which was very conspicuous owing 
to its marked contrast with the neighbouring weather-stained normal skin. 

III. 4 appears to have been the only member of the family who showed a 
marked dislike to the condition and always wore a wig to hide the frontal blaze. 
III. 2, whose family name was C——*, had fifteen children. The first, IV. 2, 
a male, with dark hair, married twice, and had eight normal children, five by the 
first wife and three by the second. The second child, IV. 5, was a piebald, with a 
large frontal blaze, white skin on the front of the neck and arms, and blue eyes. 
He transmitted the condition to all his three children. V. 3, the eldest boy, 
aged 22 and unmarried, possesses dark hair, with a V-shaped frontal blaze of 
white or cream coloured hair, the apex of the V commencing near the coronal 
suture and spreading out to a width of 3} inches, as it reaches the forehead. The 
eyebrows and some of the eyelashes are white. The next boy, V. 4, is aged 18. 
He has light hair and a very large blaze of unpigmented hair, which covers the 
whole of the top of the head. His eyebrows and eyelashes are white, and the eyes 
are blue. Both boys have white patches on the front of the neck and on the arms 


(see Plate XII). 


* Names preserved in the confidential register of the Galton Laboratory. 
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Next in the fourth generation were twins, IV. 6 and 7, both piebalds. They 
were evidently not uniovular, because one had dark hair, and one light, and the 
white blazes were dissimilar in extent, but it is uncertain which had the larger. 
Both died at an early age. 


The next, IV. 8, a girl, was normal with dark hair and eyes and remained 
unmarried. Next came a woman, IV. 10, who was a piebald with a large frontal 
blaze, white eyebrows and eyelashes, and white skin on the front of the neck and 
forearms. The right eye was blue, and the left brown (see Plate XIII (B)). Her 
child, aged 13, is quite normal with light hair and dark eyes. 


The next child, IV. 12, Mrs W——, has a large frontal blaze and dark brown 
irides. There is a large irregular patch of white skin extending from just below 
the chin to the heads of the clavicles, and round it the skin appears to be more 
deeply pigmented than the rest of the skin of the neck. There are a few small 
islands of pigmented skin near the edge of the unpigmented area. The skin of 
the anterior aspect of the forearms is unpigmented from the elbows to the wrists, 
and here also, there are some small islands of pigmented skin in marked contrast 
to the unpigmented area, in which they lie (see Plate XIV). 

The first two children of this individual were daughters, V. 8 and V. 9, both 
piebald, the third a normal son, V. 10, and then three more piebald daughters, 
V.11—13. The first of the daughters, Mrs G , V. 8 (see Plate XIV), is very 
fair with a very large frontal blaze covering the whole of the top of the head, 
and her eyebrows and eyelashes are white. Her normal] hair has pale creamy 
diffused pigment and, according to the individual hair, some to a decided number 
of granules*. The hair of the blaze has no diffused pigment and no granules. 
The irides are light brown, but the outer segments on both sides are paler and 
greenish in colour. The skin of the forehead and base of the nose is very pale in 
colour. She has a large white patch on the skin of the front of the neck, beginning 
just below the chin and widening out so as to embrace that over the inner ends 
of both clavicles. As in her mother there appears to be some concentration of 
pigment round this white area, and there are small isolated areas of pigmented skin 
near its edge. She has unpigmented skin on the anterior aspect of both forearms. 

Of her two children the first, VI. 1, a boy aged 3, is normal, the second, VI. 2, 
a boy aged 14, is a piebald (see Plate XIV). This child, VI. 2, was nine months 
old when first seen. He had a very large frontal blaze, resembling that of his 
mother and covering all the top of the head, the eyebrows and eyelashes were 
white with the exception of some of the outer hairs. Hair, pale cream in colour, 
said to be from the light area, has pale creamy diffused pigment and some granules 
(8), the granules being very small. It was obvious, even at this age, that hetero- 
chromia iridis was present. The right iris was pale except for a sector of dark 
grey occupying the upper and outer quadrant, the left iris was entirely dark grey. 
No difference in colour of the skin of the neck or forearm could be made out. 








* B to y on the Galton Laboratory scale of granular pigmentation, 
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When the baby was seen after the summer of 1913, the grey portions of the irides 
were becoming brown, the pale portion was still light blue. The face and arms were 
sunburnt, and it was noticed that the forehead was paler than the rest of the face. 
There was a pale area on the front of the neck, and the whole anterior surfaces of 
the forearms were white, the edges being very irregular in contour. There was 
also a white streak running obliquely right across the posterior or extensor aspect 
of the left forearm, and this offered a marked contrast with the rest of the surface, 
which was very brown. When the sunburn had died away the difference between 
the pigmented and unpigmented skin could no longer be made out. 

IV. 12’s second daughter, V. 9, aged 23, has only a small cream coloured 
frontal blaze, and the rest of her hair is light brown (see Plates XV aad XVI). 
The eyebrows are composed of an even mixture of brown and white hairs, and 
the eyelashes are similar, with brown and white hairs alternating. The irides are 
grey and uniformly pigmented. There is a large irregular area of white skin at 
the base of the neck. 

The whole of the anterior aspect of the right forearm is unpigmented, and 
there are similar small areas scattered over the posterior aspect (see Plate X VII). 
The left forearm is white only on the anterior aspect. 

The next girl, V. 11, is aged 9. She has a very small frontal blaze, but the 
skin of the forehead is pale (see Plate XV). The eyebrows show a division into 
two parts, on the inner halves grow white hairs only, and on the outer brown hairs. 
The eyelashes on the contrary consist of alternate brown and white hairs. The 
irides are grey and uniformly coloured (see Plate XVIII). There is only a small 
white area in the middle of the front of the neck, but there are well differentiated 
white areas on the anterior aspects of both forearms (see Plate XIII (A)), and on 
the inner aspects of both upper arms. Her hair was examined and the first sample 
showed very pale diffused pigment and some granules (8). Two more samples 
were then examined, one from the blaze and one from the neighbouring part of 
the scalp. The first showed no diffused pigment and no granules, the second 
showed the majority of hairs with yellow-brown diffused pigment and a decided 
number to plenty of small granules (y—6), but a few had no diffused pigment 
and no granules. 

The next piebald child, V. 12, died young. She had a frontal blaze and blue 
irides. Some of her hair showed very pale diffused pigment, and some granules (8). 

The next child, V. 13, also died young. She was a piebald nearer to the 
classical type than any of the others. She had a large frontal blaze, white skin 
on the forehead, and large areas of white skin on the front of the neck and 
chest, and in addition a very extensive area on the abdomen. 

Of the fourth generation the next child, IV. 13, was a male with dark hair and 
eyes, who had 5 normal children; the next, IV. 15, had fair hair and died young. 
Twins, IV. 16 and 17, came next and died in infancy*. They were heterogeneous, 

* The tendency to twin in this family is worth noting. 


26—2 








200 A Piebald Family 


a dark-haired boy and a light-haired girl. A girl, IV. 19, was born next and she 
had twin sons, V. 15 and 16, who were also normal. The last three children, 
IV. 20—22, a girl, a boy, and one whose sex I am unable to ascertain, were all 
normally pigmented and all died at a very early age. 

The pedigree confirms the strongly hereditary nature of piebaldism, and in this 
as in other published cases the character can affect cither sex, but has only been 
transmitted by those affected. Unless we are to assume that in the case of such 
a rare anomaly as piebaldism, I. 2, II. 2 or III. 1, were really unnoticed piebalds, 
then III. 2 could only be heterozygous, or since piebaldism is dominant a (D&). 
We must take IV. 4, IV. 9, IV. 11 and V.7 for pure recessives (RR). Thus the 
number of piebalds in the five sibships of generations IV, V and VI should be one 
quarter, ie. $(15+3+4+1+6+2)=7 nearly. We have actually 14 out of 27, 
thus piebaldism does not seem to act numerically as a pure dominant. 

The areas of unpigmented skin are less than in the classical piebalds, but it is 
probable that in some, at least, they are larger and more numerous than I have 
stated. On the covered parts of the body and legs, which I was unable to 
examine except in the baby, they would not be very noticeable. It was not until 
I had noticed the white skin on the neck and arms of one of them that I was told 
anything about the existence of similar patches on the others. If true, it is 
remarkable that none have had white patches on the legs. 

With regard to the local distribution of the pigment, there appears to be an 
excess at the edges of some of the unpigmented areas, as has been noted in other 
cases. In the case of other pale areas, the demarcation between them and the 
normal skin is very slight, and is probably due to the fact that they are not wholly 
unpigmented. ‘This remark applies especially to the forehead, which in some of 
them looks paler than natural, but not wholly devoid of pigment. 

In some the eyelashes are alternately white and brown, and in others the eye- 
brows are similar, and in one at least hairs growing on the scalp near the blaze 
are in some instances entirely without either diffused or granular pigment. This 
suggests that the skin beneath may show a deficient and irregular distribution of 
pigment. 

The most interesting feature is the occurrence in three members of the family 
of well-marked heterochromia iridis, a character which has been met with in 
members of a piebald family, but always independently of their piebaldism, never, 
as in this case, in true association with it. It proves conclusively that these cases 
are not congenital leucoderma. 

There seems to be no association of piebaldism and general lack of pigmenta- 
tion of hair and irides. Affected and unaffected members have been both fair and 
dark, but the fairest piebalds seem to have the most extensive frontal blazes. 

In the cases photographed the individuals were blonds and there has been great 
difficulty in getting a good photographic contrast of differences of pigmentation 
very noticeable in the living subject. 
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V. 3 and V. 4 as children showing their marked V-shaped frontal blazes. 
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The blaze may be seen with difficulty under a lens, 


but the heterochromia iridis is easily distinguishable. 
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V. 9. Showing white forelock or blaze, 
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Right forearm of V. 9 showing white patches on posterior aspect. The photograph is untouched and it is difficult 
to bring out by photography the grades of pigmentation when the arm is untanned by the sun, although they 
are quite clear on actual inspection. 
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Large photograph of V. 11 to show paleness of forehead and white hairs on inner half of eyebrows. 











CLYPEAL MARKINGS OF QUEENS, DRONES AND 
WORKERS OF VESPA VULGARIS. 


By OSWALD H. LATTER, M.A. 


Upon the front of the head of Vespa vulgaris certain yellow markings stand 
out conspicuously upon the otherwise black surface. Below the three ocelli 
and between the upper portions of the two compound eyes there is a median 
four-sided yellow patch, the “corona”; to the right and left of this, separated 
from it by a fairly wide interval, and occupying the bay of each of the compound 
eyes is a pair of elongated yellow blotches; while straight below the corona and 
between the lower portions of the compound eyes is a very conspicuous yellow 
area which extends over the clypeus and down to the labrum or upper lip which 
lies between the two mandibles. This ciypeal patch of yellow bears upon it a 
black mark which is subject to considerable variation. I distinguish in the queens 
and workers five chief types of this black mark: see diagrams on p. 202. In 
Type I a broad vertical black band extends right through the yellow patch from 
the top to the bottom; a little below its middle the band bears to right and left 
a pair of bluntly pointed and slightly upturned arms: the portion of the median 
band below these arms is somewhat narrower than that above. Type II is derived 
from I by suppression of the black portion below the transverse arms. In Type III 
the extent of the black colouring is yet further reduced by the absence of the 


upper half (or thereabouts) of the vertical band. In Type IV the lower part of 


the vertical band re-appears, but the width of all the components is very much 
less than in any of the preceding types. In Type V the component parts of the 
black marking cease to be in contact; the upper portion of the vertical band 
is interrupted by a broad belt of yellow; the two “arms” are separated from the 
lower par! of what remains and from one another; while there is no black at 
all below these remnants of the “arms ”—a feature recalling Types II and III. 
Types IV and V are however represented only by single individuals in the series 
examined, 

Between these main types certain intermediates occur. Thus some individuals 
have the black piece below the “arms” very narrow, approximating therefore 
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to II, but conforming to I if we take extension of the black right through the 
yellow as the criterion of I; such individuals are distinguished as 1+II. Others 
again conform to II but possess a slightly darker stain on the yellow in the line 
where the distinctive lower black portion of I might occur; these are called II + 1. 
Similarly, intermediates between II and III are recognisable: in II +- III the top 
of the upper portion of the vertical black band is very narrow; while in III + II 
there is a mere stain on the yellow of this region. A single instance occurs of an 
intermediate between I and III (1+ III), where the vertical black band extends 
right through the yellow, but is much narrowed at its upper extremity. 


Front of head of V. vulgaris ¢. 
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Eye 
Yellow 
Patch tn c 
orona 
bay of Eye 
Clypeus 
Mandible 
(All parts left white are actually yellow.) 


Clypeus alone 


HOG 


Types 


UIQ @ OC 


VI (4 VII) VIII (4 VIL) Vill 


Drawn by K. W. Merrylies. 


My first examination consisted of about 200 tubes containing queens of 
Vespa vulgaris from different nests. In the case of some queens the heads 
were missing, and in the course of transit the contents of some of the tubes 
had got loose in the jar. I have numbered these 199 to 208. The results are 
given in Table I (p. 204) and the summary below: 
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Type I . Pure ” 
= I+II 11; 62 
99 ; I+Ill 1 
Type II II+I =) 
_ Pure 94) 120 
‘ IT + Il 4| 
Type III III + I 1) 
ee Pure or 4, 
a IIl+1V 0) 
Type IV IV+I1II - | 
ss Pure am @F 
as IV+V o| 
Type V V+I1V 0) 1 
2 Pure 1J 


Total: 188 
It will be seen that transitional cases undoubtedly occur. The bulk of the 
queens, however, fall into Types I and II, or queens are very little variable. 
To test: (i) whether this variability was still further lessened by taking only 
the queens from a single nest, and (ii) the relative variability of queens, drones 
and workers, I now examined all the queens, workers and drones of a single nest 
> of V. vulgaris. 
In this case all the 127 queens were of Type II*. 
The classes of the workers are given in Table II (p. 205) and may be sum- 
marised as follows: 


Type I Pure 5) 10 
os v.s or (I+ II)? 5} 

Type II If+I 6) 162 
ea Pure = ions 


Total: 172 

It will be seen that they are somewhat more variable than the queens of the 
same nest, but not so variable as queens from different nests. 

I now turn to the drones of this same nest. I had 150 at my disposal. 

The drones exhibit a very wide range of facial markings. In the material 
examined comparatively few fall into the scheme of classification adopted for 
the queens and workers, and it thus becomes necessary to resort to six types 
of face which appear to be peculiar to the male sex. These are numbered VI, 


* There were 129 queens in this nest, but No. 34 was missing and No. 98 had its head damaged too 
badly for classing. 
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VI (4 VII), VII, VII (4 VIII), VIiI and IX, see diagrams, p. 202. 
there are two somewhat elongated black dots upon the yellow clypeus, one being 
sub-central, the other on the ventral margin; in VI (} VII) the ventral dot is 
longer dorso-ventrally and a third dot appears upon the left side (right side, in 
figure seen from in front) opposite the gap between the two previous dots; 


TABLE I. 


Types of Clypeal Markings in V. vulgaris Queens. 


NSIQfFWWNe 
= 


Jay J a Se I ot 








No. No. | 

| 
87 [+I] 130 | 
88 II 131 
89 I 132 
90 Il +] 133 
91 [ 134 
92 I] 135 
93 Lil+II 136 
94 - 137 
95 I] 138 
96 [] 139 
97 I 140 
98 Il+] 141 
99 [1+] 142 
100 I 143 
101 - 144 
102 I] 145 
103 I 146 
104 II 147 
105 LI 148 
106 149 
107 I 150 
108 I] 151 
109 | 152 
110 1+II 153 
11] IT +I 154 
112 I] 155 
113 I 156 
114 I] 157 
115 [I+I1I] 158 
116 II 159 
117 I] 160 
118 | 161 
119 I] 162 
120 Il+I1I] 163 
121 I] 164 
122 I] 165 
123 166 
124 I 167 
125 [] 168 
126 Il+I 169 
127 I] 170 
128a I] 171 
1285 I] 172 
129 | I+II 








Loose 


199 
200 
201 
202 
203 
204 
205 
206 
207 
208 
209 





In Type VI 
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TABLE II. 
Types of Clypeal Marking of Workers of a single Nest of V. vulgaris. 
| | 
| No. | No No. No No. | | 
| 
a —| | 
2 37 II 74 II 112 II 150 | II 
2 oe 38 II 75 II 113 II 151 | II+I | 
3 | as 39 I 76 II 114 II 152 | II | 
4 oe 40 II 77 II 115 II 153 nH 
ol - vate e|—~ im | 8 2m] 
6 | — 42 | II 79 I 117 II 155 II 
7 — 43 II 80 II 118 II 156 | II 
8 ms 44 | I 81 II 119 II 157 | II 
9 II 45 |Iv.v.s.] 82 II 120 II 158 | II 
10 II (I+II]}} 83 | I 121 II 159 II 
11 II 46 | II 84 | II 122 II 160 I 
12 | Iv.s. | 47 | II 8 | II 123 I 161 II 
([I+II] } 48 II 86 II 124 II 162 Il 
13 II a cae 87 | Il 125 II 163 | II | 
| 14 II 50 | II 88 II 126 164 | I1+I 
| 15 I 51 | II s9 | II 127 | Il+1 | 165 il 
| 16 II 52 II 90 | II 128 I 166 I | 
| 17 II 53 | II 91 | II 129 II 167 x | 
18 II 54 II oS. | =a 130 II 168 mn 
| 19 II 55 II 93 | II 131 Il 169 Il 
| 20 II 56 | I of | TT 132 II 170 iT | 
| 21 II 67 | II 95 | II 133 II 171 II 
22 II 68 | Iv.s. 96 II 134 II 172 II 
23 II fi+ini wi] — 135 Il 173 I 
“| II 59 I 98 II 136 | Il+I | 174 II 
25 I 60 II 99 ) II 137 II 175 II 
9 | II 61 Il 100 | II 138 | II 176 il 
27 | II 62 I 101 Il 139 | I 177 I 
28 | II 63 II 102 | II 140 II 178 II 
29 | II 64 II 103 II 141 II 179 if 
30 | Iv.s. | 65 I 104 | I 142 II 180 II 
|(i+0]} 66 | 105 | I 143 II 181 — 
Je oe 67 II 106 II 144 I 182 II 
32 | +i] 6 | Wm | 17 | mw fis | un | iss | 
33 | II 69 I 108 II 146 Il 184 II 
| 34 — 70 II 109 | II+I 147 II 185 — 
35 | dIv.s. | 71 | I 110 II 148 II 186 
(I+1I}} 72 | 111 II 149 | IL 187 II 
36 II 73 | I 

















in VII the two median dots are united by a slender black line, and there is a pair 
of lateral dots, right and left; in VII (4 VIII) the median line is of uniform 
width, extending from about the centre to the lower margin, and to its left side 
there is a single dot; in VIII the median line alone is visible, both lateral dots 
having disappeared; while in IX there is no continuous median line, but merely 
two black spots, one at the extreme dorsal and the other at the extreme ventral 
side of the clypeus. It will be noticed that Types VII—VIII approximate to 
Type IV in so far as the black stripe begins at about the middle of the clypeus 
and extends right down to the ventral margin. 
Biometrika x 27 
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The data are given in Table III (p. 207) and are summarised below: 


very narrow 6 
Type I 1a : VII) it 7 
Type II no horns 1, very narrow 1 2 
Type III : ; ‘ ; 0 
Type IV : 4 ; ; . 0 
Type V . ; 2 ; : 0 
Type VI Pure 2 a 60 

f VI+ VII 1 
Type VII ; : ; : . 22 

Type VIII (VIII near VI) 8 
. (VIII + 4 VID) 2? 58 

a Pure 48 
Type IX : 2 


Total: 151 


It will be realised at once how far more variable the drones, of even one nest, 
are than the workers or queens for this character. But their variability is rather 
of a negative than a positive character, appearing to consist in more or less extensive 
absence of the fuller markings of queen and worker. 


The results here deduced for variability of non-measured characters do not 
wholly agree with those found,by Wright, Lee and Pearson on the wing measure- 
ments of the same nest of V. vulgaris. They found that for absolute measurements 
the variability as determined by the coefficient of variation was in every case such 
that the worker was more variable than the drone and the drone than the queen. 
On the other hand they found when they dealt with indices that the drone for 
wing measurements was slightly more variable than the worker and the queen 
less variable than either*. Possibly the divergence apparent here may be explicable 
in the sense of the drone’s variability lying in the present case in an absence 
of marking rather than in any positive variation. The drone’s variation is about 
a centre of much diminished marking. If we could measure the variation in the 
total area of marking in queen and worker we might find it as great as the varia- 
tions in the smaller markings of the drone. 


It would be of much interest to investigate a series of drones from different 
nests. It is clear that the clypeal markings form a secondary sexual character 
and they would probably provide classifications for hereditary purposes. 


* Biometrika, Vol. v. pp. 414 and 421. 
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TABLE III. 
Types of Clypeal Markings in Drones of a single Nest of V. vulgaris. 








No No No, | No No 
l VIII 33 VII 63 VII 93 VI 123 Vill 
near VI 34 VI 64 | VI 94 VI 124 I 
2 VI 35 VIII 65 | Vil 95 VI narrow 
3 VIII 36 VI 66 | VIII 96 Vil 125 VI 
4 VIII 37 VI 67 | I 97 I 126 VIII 
5 Vi 38 VII | very very 127 Vil 
6 VI 39 VI | narrow narrow 128 VII 
7 VIII 40 | VII 68 V 98 VIII 129 Vill 
8 VIII 41 i Vi 69 VItl 99 VIII 130 Vill 
9 VIll 42 VI 70 VI 100 VI 131 VI 
10 IX 43 VIII 71 I] 101 VI 132 VIII 
11 VI 44 VI but no 102 VIII 133 VI 
12 VIII 45 VIII horns 103 VIII 134 VI 
13 VI 46 VIII 72 VIII near VI} 135 Vi 
14 VI 47 VI 73 VI 104 VI 136 VIII 
15 VIII 48 VIII 74 VIII 105 VIII 137 VI 
near VI 19 VII 75 VIII 106 Vil 138 I 
16 Vill 50 VIII 76 VIII 107 VIII dots of 
17 VI 51 VII 77 VI 108 VIII VII 
18 VI 52 VII 78 VI 109 VI 139 VII 
19 VIII 53 VIII 79 I 110 Vi 140 VI 
20 VII 54 Vi narrow 111 VIII 141 VI 
21 VII 55 I 80 VI 112 VIII 142 Vill 
22 Vill very 81 VI 113 VI 143 VIII 
23 VIII narrow 82 VIII 114 VIII near VI 
near VI} 56 [X near VI 115 VI 4.) 8 
24 Vil 57 I 83 VIII 116 VI 145 | VIII 
25 VII very 84 Vil 117 I] near VI 
26 V! narrow 85 Vi very 146 VIII 
27 Vi 58 VIII 86 VI narrow 147 VI 
28 VI near VI 87 VIII 118 = 148 VIII 
1 VII 59 VIII 88 VII 119 VII 149 VI 
29 VIII 60 VIII 8e VI 120 VI 150 VI 
30 VIII 61 VIII 90 VIII 121 VI 151 VI 
31 Vi 62 VIII 91 VII 122 Vi 152 VI 
32 VIII 1VII 92 VII 
4VII 
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TABLE OF THE GAUSSIAN “TAIL” FUNCTIONS ; 
WHEN THE “TAIL” IS LARGER THAN THE BODY. 


By ALICE LEE, DSc. 


In a paper published in Biometrika, Vol. vi. pp. 59—68, tables for the in- 
complete normal moment functions were printed, and they have since been 
reproduced in Tables for Statisticians and Biometricians recently issued from the 
Cambridge University Press. From these tables values of the Gaussian “Tail” 
functions were deduced and a short table of wy, and yy. appeared in Biometrika, 
Vol. vi. p. 68. The value of these functions being demonstrated in practice 
during the last few years, a more complete table of y,, W., Ww; has appeared in the 
Tables for Statisticians and Biometricians. 


In the introduction to those tables, however, Professor Pearson indicated that 
it was important to have a similar table when the “tail” forms more than half the 
entire curve, and gave the fundamental formulae for obtaining the numerical 
values of the functions. The present table has been calculated to supply the 
want thus indicated. 
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Let the figure represent a Gaussian curve of total population N and standard 
deviation «. Let AB be the ordinate at which it is truncated and let 
OB=h=Nh' xe. 
Let GH be the ordinate through the mean G of the truncated portion and BH =d, 
the distance of the mean from the line of truncation, let = be the standard 
deviation of the truncated portion about GH, and n=the area of the truncated 
portion, or of the population observed. Then if any material be supposed to 
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form a truncated portion of a normal curve, d, n and > can be found (see Tables, 
pp. xxvii and 25). 


We have 


* GE 5 ne oe (i), 
DR SCMIE pics tigi sed. cashes (ii), 
RC? | mee AT (iii). 


These are tabled for each value of h’, at first proceeding by ‘01 and then by 10 as 


unit. 


Vs. 


markedly bimodal distribution obtained by him for American bones. 


Now y, being known we find h’ from the table, and hence deduce yw, and 
vr. gives us the value of o from known d. 


Hence h=h' xo can be found, 
lastly (iii) gives us the total population from which n is drawn. Thus the constants 
N, o and h which fix the total Gaussian are determined. 

It will be sufficient to illustrate the method of using the tables on certain data 
as to the English thigh-bone, recently published by Parsons*. 


Dwight+ has adopted a method of sexing human femora on the basis of a 


He terms 


female any femur with diameter of head less than 45 mm., and male any femur 


with diameter of head over 47 mm. 


of 45 mm. and he gives 12 to ? and 8 to J’; 


Parsons follows this rule and sexes by other 
points femora with heads from 45 to 47. As unsettled remainder he has 20 femora 


of 46 mm. and 47 mm. he has 41 


femora and he gives 4 to 2 and 37 to f. As a result of this process he obtains 
a female frequency curve which rises very abruptly at high values of the diameter, 
and a male frequency curve which rises very abruptly for low values of the femora. 


But, if there really be any marked skewness in frequency of the parts of the 


human skeleton, which is very unusual, we should anticipate that it would be of 


the same sense. 

















The ¢ 48 mm. femur according to the 
male but presumably it had marked female 
male characters in any bone below 45 mm. ? 
remarkable dip in the total material at 46 


division. 


Parsons’ distributions are as follows (loc. cit. p. 256): 








characters. 
It will be seen that there is a 
mm. which corresponds to Dwight’s 
In material measured six years ago in the Biometric Laboratory, where 


50 


Pe yet 
19 


rule should have been treated as a 


Were there no marked 


every bone in a relatively large series was measured, no such dip occurs and there 


is in those data no justification for Dwight’s method of sexing. 


The group of 


29 f bones at 47 mm. and the sudden cut off at 45 mm. seems to condemn this 
method of sexing, at any rate from the statistical standpoint. 


* Journal of Anatomy and Physiology, Vol. xtvim. pp. 288—267. 
+ American Journal of Anatomy, Vol. tv. p. 19. 


t This material has been statistically reduced and will shortly be published. 
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Without arguing this point out here, we may illustrate the use of the Table 
(p. 214) of w’s by taking two of Parsons’ frequency distributions for females; we 
will cut them off at the points suggested, and then investigate the total popu- 
lations of females which result. Our author pools for these distributions right 
and left bones. 


Taking the diameter of “head of femur” for the females, we have 





Diameter in mm. ... | 36 





Frequency ... ee ee 

















These are exactly the bones the Dwight process gives as female. We find 


>? = 2°8851, 
d =2°6159 (measured from 44°5). 
Hence ap, = 2/d? = 4216. 


Whence by interpolation from the table 
h’="782, W.='864, W,=1:278, 


leading to o = 2'260, h=1°767, 
and Mean = 42°73 mm., N=88-2. 
Parsons gives for R. femur, Mean = 43, 


L. femur, Mean = 42, 


and the total number of bones dealt with 55 + 48 = 103 (Tables, loc. cit. pp. 249— 
251). In his frequency distribution (p. 256) he only records 85 female bones, 
which give a mean of 42°54 and a standard deviation of 2078 mm. These values 
are clearly not widely divergent from those we have found above by supposing 
all bones under 45 to be female. 


To test the matter further the 105* female bones of which the head was 
measured by Parsons were taken out. They provide the distribution: 














| | | | 
Diameter in mm. ... | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 
re OTeT Meee Sey Ee Pd | re cy, _| 
| Bt 5 2 a ON OR IEE 
| Frequency ... | 1] 21 )]—, 4] 10] 18| 16) 24|13| 14] 3 |}—] 1 
| Me oe be 
These give Mean 


Il 
> 
bo 
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~I 

Leen? 
‘ 
= 
ll 
—_ 
oO 
or 


$.D.= 1'°996 


* It is not possible to say whether he has omitted two queried measurements. He has not omitted 
bones he queries in breadth of lower articulation. 
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Cutting off all bones over 45°5 we find 
*>?=2°6392, d=2°6264, 
"ab, = 3826. 
Hence h’ ="984, v,=°783 and y,=1°195. 


leading to 


These provide for the non-truncated population, 
Mean 42°48, s.D.=2:056, N=104, 


which are in still better agreement with Parsons’ constants for the 105 bones than 
the constants for the 85 bones were for their series, It would appear therefore that, 
if we suppose all bones under 45 female and use our Tables, we get results in 
reasonable accordance with Parsons’, and possibly by a theoretically more justifiable 
method than endeavouring to sex the bones above 44 and below 48 from other 
characters. 

We have considered from the same aspect the character breadth of lower 
articular end of femur. Parsons’ distribution of 89 female femora is as follows 


(p. 257): 





Breadth in mm. ... | 60 | G1 | 62 | 63 | G4 | 65 | 66 | 67 | 68 69) 70'| 71 | 72) 73 | 7 


Frequency ee See RESUME SE ce ff SF ce ee ee ee 


In this Table he has only one bone in excess of the numbers on which he 
bases his means*on pp. 250—1. If we truncate at 69°5, Le. reject all bones over 


69 mm., we find 
=? = 3°9803, d=2°9058, 


and ap, = "4714. 


Hence we deduce 


These lead to 
h=1°503, o=2°839, N=984, Mean=68°00 mm. 
The actual values given by Parsons’ distribution above are 
o=2°571, N=89, Mean=67°54 mm. 

Thus the agreement is not nearly so good as for the diameter of the head ot 
the femur, being about 10°/, wrong in o and N. It should give as good a result 
if the method were quite satisfactory, for the bones have been sexed by the 
diameter of the head, and the limit 44 mm. for diameter of the head corresponds 
fairly closely to 69 mm. for the breadth of lower articulation. 


As this paper is not intended as a discussion of Parsons’ data, to which we 
hope again to return, we will only deal with one more illustration of the use of 


| 
| 


Diameter of Head ) 
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the Table. We take out from his Tables, pp. 244—248, the diameter of head of 


femur for 174 male bones. 





} 


Diameter of Head in mm. ... | 45 | 46 | 4? | 48 | 49 | 50 























Frequency ... nee wes | 9 | 10 
| | bess te EER SP ee: 





The constants of this distribution are 
Mean = 49°14, o=2:377, N=174. 
Truncating at 47°5 we have 
>? -= 3°4841, d =2°7869, 
whence yy, = ‘4422, and from the Table 
h’ =-679, W.="908, y,=1°331. 

These lead to h =1:7182 
and Mean = 49°22, o=2'530, N=162, 
ie. to a “tail” of 40 not one of 52 below 47°5. Actually this tail distributes itself 
as follows: 


Under 45 45 46 47 
Gaussian tail 5 6 12 17 
Against Parsons’ 0 9 10 33 


This confirms our previously expressed view that probably a considerable 
number of the bones classed as 47 mm. are really female femora, and that the 
male distribution runs considerably beyond 45 mm. into the range treated as 
purely female. 

Finally let us try the result of pooling male and female bones and breaking up 
the composite frequency by the method of Phil. Trans. Vol. 185 A, p. 84. 


We have now 279 bones distributed as follows: 











“penned 36\ 37| 38) 39 40 | 41\ 42) 43\ 44\ 45 \ 46 47 \ 48 49 | 50 | 51| 52| 53 54] 0 
ix 2; eek AURA Brot ae TA hn | EO IR! IEA eek 2) a i = 
Frequency «| 1] 1 410/18 16 24.13, 23  13|33| 18 | 38/20 1812/6 8/3 | 

The constants are Mean = 46°63, s.D. = 3°93, 
fy = 15°4040, bs =— 67791, 
pu, = 5415162, p= —530°5339. 





The nonic is 
q:’ — 5°9618q,’ + 0689q.' + 9°8357q,° — 3°4275q.4 — 8 2041¢,? — 30209." 
+ '0144g, — 0097 = 0, 
giving the root g.=— ‘934 and p,=— 9°34, 
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and ultimately the two components : 


Male Female 
Mean : : pia ah ; 49°83 43°72 
Population ; ‘ : 133°25 145°75 
Standard Deviation . ¢ : 2°231 2°662 
Max. Ordinate . : : ‘ 23°83 21°84 


While the means agree roughly with those obtained by Parsons’ sexing (49 and 
43), we see that this analysis much more nearly equalises the number of male and 
female bones, and indeed makes the female population rather larger than the male, 
while Parsons has 79°/, more males. The “truncated tail” method would probably 
give results in better accordance with the present had we not truncated at the 
quite arbitrary Dwight-Parsons’ divisions. 

These examples may suffice to illustrate the application of the Tables to 
anthropometric measurements on man, where we can feel fairly confident that 
the material, if sufficient in quantity, would be adequately described by a Gaussian 
or normal distribution. Such cases may arise when material for the two sexes, 
or for two races, is commingled and we can be fairly certain that one or other or 
both “tails” of the material present homogeneous parts of the mixture. 

Another illustration drawn from Galton’s data for American trotters will be 
found in the Tables for Statisticians, p. xxvi. The chief weakness of the method, 
besides the assumption of the Gaussian, often quite legitimate, is the absence as 
yet of the values of the probable errors, which values must be very considerable 
for slender material such as that used above. 


See following page for Table of Gaussian “Tail” Functions. 
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Table of Gaussian “Tail” Functions, “ Tail” larger than “ Body.” 


See 


Table of the Gaussian “ Tail” Functions 
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VIL. Suggestions for Future Studies in this Plant 


I. Inrropvuctory*. 


In the summer of 1905 Professor J. W. H. Trail drew my attention to an 
extraordinary example of variation which occurred in the several organs of the 
flowers of Lepidium Draba Linneus. At that time I examined in detail 1832 
individual flowers taken from a single plant growing in a piece of uncultivated 


* I am pleased to have this opportunity of expressing my great indebtedness to Dr J. F. Tocher for 
invaluable assistance in the biometric part of this paper. The correlation and other constants were 
calculated in his laboratory, and without his assistance the publication of this paper would have been 
greatly delayed. I must also thank Professor Karl Pearson, in whose department in University College, 
London, the statistical study was originally undertaken, for reviewing this paper for publication and also 
for much kindly criticism and advice. To Professor Trail my thanks are also due for many botanical 
hints. 
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ground in his garden, and the results of these observations form the basis of the 
present contribution to the study of the variation in the Crucifere. 


Botanical problems, which have been hitherto attacked from the biometric 
standpoint, have been comparatively easily handled, because the material has 
been more or less homogeneous in character. For example, variations in the 
number of sepals of Anemone nemorosa* or in the number of ray-florets of 
Chrysanthemum leucanthemumt, and the consequent distribution of these are 
capable of direct treatment by Pearson’s well-known method of fitting frequency 
curves, 


The only work comparable to the one in hand occurs in Biometrika, Vol. 1. 
p. 145 (Variation and Correlation in the Lesser Celandine), but in this case the 
numbers of members in the calyx, corolla and andreecium have been examined 
as a basis for a study of homotypic correlation and in this flower each of these 
organs consists of a single constituent with numerous members. 


The problems studied in this paper, however, are more complex inasmuch as 
they deal not with one organ of the flower but with all the organs, their con- 
stituents and members both separately and collectively. 


It is also, I believe, the first biometric work of its kind on a cruciferous flower 
and embodies a study of chorisis, that is, “the splitting up cr division o . © e or 
more components of a flower into two or more equal or unequal parts ”—a factor 
which is supposed to have been of the utmost importance in the evolution of the 
natural order—Crucifere. A complete discussion of this phenomenon is reserved 
until the flower is studied in detail. 





It would be well here to emphasise the fact that the flowers examined for this 
study were not taken from different plants but, on the contrary, were obtained 
from several inflorescences growing on stems which had arisen from buds on the 
roots of a single parent plant. This mode of reproduction is rather unusual, but, 
in the present instance, is of particular interest inasmuch as it gives greater 
homogeneity to the material. 

The parts of the flower which have been considered are (a) the perianth, which 
consists of (1) the calyx and (2) the corolla, (b) the andreecium and (c) the 
gynecium. 


The functional differentiation of these organs is of great importance in the 
interpretation of results so that it might be well to recall the particular réles 
which these play in plant economics. 


The gynecium and the andreecium are respectively the female and male organs 
of reproduction and consist of carpels and stamens, while the perianth forms a 
protective covering for these delicate structures. The calyx or outer organ of 


* Yule, Biometrika, Vol. 1. p. 307. 
+ Biometrika, Vol. 1. p. 309 et seq. 
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the perianth is concerned solely in the protection of the flower in the bud, but 
the corolla, in the open flower, also serves, along with the honey-secreting sacs at 
the base of the stamens, as an attraction for insects. 


The characters which have been taken as a basis for this study are numerical, 
e.g. the number of petals in the corolla, but no measurable characters, e.g. the 
length and breadth of the petals, have been considered, although, as will be pointed 
out later in connection with possible future studies in this flower, these characters 
might also with advantage be taken. 


The Cruciferae, as an order, are usually regarded by botanists as being very 
definite in type and no observations have been recorded to show to what extent, 
if any, deviation from the recognised botanical floral formula exists, so that the 
main object of this paper was to determine the frequency of the variability of 
the parts of the various organs and constituents, and also the degrees of correlation 
existing between the organs themselves. 


The mode of observation is worthy of remark, however, as it might well be 
argued that if the flowers used for examination were fully “blown” deficiency in 
the number of parts might be due to post-developmental fracture, but in all the 
cases here recorded the observations were made on flowers in bud or only half open 
so that the influence of wind or other external agency is altogether discounted. 
The material was also examined microscopically in all cases so that there should 
be no possible doubt as to the exact origin of any member. The importance of 
this will be seen in the details of the analysis. 


II. BOTANICAL. 
l. Specific characters. 


The generic and specific characters of Lepidiwm Draba may be obtained in any 
complete systematic botanical work so that it is unnecessary to repeat them here, 
but a few notes bearing especially on the study in hand may be of value. 


It is a perennial about a foot in height and is covered by a minute down from 
which its popular name, the hoary cress, is derived. The inflorescence is a raceme 
not much lengthened and so forms a broad, almost flat, corymb-like termination. 
The individual flowers are small, white and numerous. The constituents of calyx, 
namely the sepals, are green; they are short, nearly equal and bear no pouch at 
the base. The petals are small and white; they are equal in size, obovate, 
undivided and generally stalked. The stamens are six in number; the filament 
is simple, i.e. it bears no appendages, and is shorter than the petals; the anther 
consists of two roundish lobes. The pods are “broader than long”; they are com- 
pressed laterally at right angles to the narrow partition. The thick valves are 
boat-shaped and sharply keeled but not winged ; each valve contains a single seed. 
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2. Morphology of the flower. 
The typical flower consists of six whorls, made up in the following manner: 
(a) Calyx 2. 
(b) Corolla 1. 
(c) Andreecium 2. 
(2d) Gynecium 1. (Plate I, fig. 1.) 


(a) Calyx. This organ is composed of two whorls each consisting of two 
sepals. The outer pair arise at one level on opposite sides of the flower and are 
inserted on a slightly lower plane than the inner pair; they are parallel to the 
plane of compression of the gynecium. The inner pair are also situated opposite 
one another but in a plane perpendicular to that of the outer pair; they are thus 
at right angles to the plane of compression of the gynecium. These whorls are 
denoted on Plate I, fig. 1, by the Roman ‘numerals I and II respectively. 


(b) Corolla. This organ consists of four petals all inserted at one level and 
alternating with the position of the sepals; they thus constitute a single whorl. 


(See III, Plate I, fig. 1.) 


(c) Andracium. Six stamens form the andreecium ; they arise at two different 
levels and thus constitute two separate whorls. The outer whorl, which is lower 
down, consists of two stamens which are shorter than the others and corre- 
spond in position to the inner sepals. The inner whorl consists of four stamens, 
arranged in pairs which correspond in vosition to the outer sepals. (A reference 
to the figure (Plate I, fig. 1) in which the two whorls are marked IV and V 
respectively will make this clear.) 


(d) Gynecium. This organ consists of two carpels forming the sixth or 
innermost whorl. (See VI, Plate I, fig. 1.) 

It will be seen from the foregoing description that the order of the six whorls 
here detailed is that in which they would be found were we to strip the flower of 
its components at the different levels consecutively from below upwards. It is 
also the order in which we would find them, passing from the outside to the 
centre, were we to cut a transverse section through the flower. 

Another point, however, which is not so obvious but one which has special 
interest in our study, is the fact that this is also the order in time of development. 


The actual sequence in which these constituents of the flower appear in the 
bud is therefore : 


I. Outer whorl of Calyx (Sepals). 

II. Inner whorl of Calyx (Sepals). 

III. Corolla (Petals). 

IV. Outer whorl of Andreecium (Stamens). 
V. Inner whorl of Andrcecium (Stamens). 
VI. Gynzcium (Carpels). 
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3. Conception of Chorisis. 


Chorisis or reduplication is generally looked upon by botanists as a means of 
multiplication of the parts of a-flower. It consists in the division or splitting 
of an organ in the course of its development by which two or more organs are 
produced in place of one. Chorisis may take place in two ways: 


(1) transversely—when the increased parts are placed one before the other, 
that is, the resulting components are on the same radius; this is known as vertical, 
parallel or transverse chorisis ; 


(2) collaterally—when the increased parts stand side by side, that is, on the 
same circumference. 


Transverse chorisis is supposed to be of frequent occurrence ; thus the pistils 
of Lychnis and many other caryophyllaceous plants exhibit a small scale on the 
inner surface at the point where the limb of the petal is united to the claw. The 
formation of these scales is supposed by many to be due to the chorisis or unlining 
of an inner portion of the petal from the outer. 


Collateral chorisis is seen in different natural orders. In Strephanthus, in place 
of two stamens there is sometimes a single filament forked at the top and each 
division bears an anther. This is usually supposed to be due to collateral chorisis 
arrested in its progress. 

The flowers of the Fumitory are also generally considered to afford another 
example of this type of chorisis. In these we have two sepals, four petals in two 
rows and six stamens, two of which are perfect and four more or less imperfect. 
The latter are said to arise by collateral chorisis, one stamen being divided into 
three parts. 


Collateral chorisis may be compared, according to Bentley, to a compound leaf 
which is composed of two or more distinct and similar parts. 


Let us now consider chorisis in its bearing to the flower under consideration. 
In the description of the morphology of the flower we noted that in the inner 
whorl of the andreecium there were four stamens arranged in pairs while in the 
outer whorl there were only two stamens situated singly. Various opinions have 
from time to time been advanced to explain this anomalous structure so that it 
might be well to briefly review these. Of the andrcecium of the Crucifere Oliver 
says: “The two pairs of long stamens are generally thought to be due to chorisis 
or the division in the course of development of single antero-posterior stamens. 
Others have thought that the six glands represent abortive stamens and that these 
with the six stamens make up a normal series of twelve in three whorls.” 


De Candolle held the view that the stamens formed a single, originally 
tetramerous whorl alternating with the petals in which the median members, 
ie. the anterior and posterior, were cleft (chorised) in two. Since however the 
lateral stamens are inserted lower down than the median stamens and are also, 
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as already pointed out, formed earlier in the bud, this view is clearly untenable. 
Two whorls must be taken into consideration owing to the difference in the levels 
of insertion, the single stamens being lower down. Kunth, Wydler, Chatin and 
others regard these two whorls as typically four-membered (tetramerous), those of 
the outer whorl corresponding in position to the sepals, those of the inner whorl 
corresponding in position to the petals. To arrive at a typical cruciferous flower 
from this, two stamens in the outer whorl abort, while the individuals of the two 
pairs of the inner whorl come together. (Plate I, fig. 3.) 

Others (Krause, Wretschko and Duchartre) regard the outer whorl as typically 
dimerous (i.e. with two constituents) and the inner whorl as typically tetramerous 
(i.e. four-membered). 

The more modern view, however, regards both whorls as dimerous but the 
inner one chorised collaterally thus giving the typical cruciferous flower. 

The reasons put forward to support this theory are as follows: 

(1) The upper long stamens are usually paired in the median line, also 
sometimes coherent. Further, in place of one or both of the pairs, there occurs 
sometimes a single stamen—a hint at reversion, or one or both pairs may be 
replaced by three or more—a suggestion of further chorisis. 

(2) In the earliest visible stage of development in the bud it may be seen 
that each pair of stamens arises from a single wart-like projection and that division 
is therefore a secondary result. This is not very easily demonstrable in the 
Crucifere but is more evident in a closely allied family, the Capparidacez. 

Since the present study includes numerical variation in the different constituents 
and positions of the andreecium it will be interesting to note to what extent any 
one of these theories is borne out by the variations in this flower. 


4. Orientation of the flower. 

Having defined the positions of the various stamens relative to one another, 
in what is usually regarded as a normal cruciferous flower, let us now consider the 
different possibilities when the flower is abnormal. 

Suppose that one of the pairs of stamens of the inner whorl is represented by 
a single stamen, that is, suppose that chorisis had not taken place. Now with 
regard to the peduncle of the inflorescence this stamen might be placed in two 
diametrically opposite positions, namely (1) it might be adjacent to the peduncle 
(Plate I, fig. 4) or (2) it might be on the distal half of the flower with reference 
to the peduncle (Plate I, fig. 5). 

Two questions now arise, (1) do non-chorised stamens occur as frequently as 
chorised stamens on the side of the flower next to the peduncle? or (2) do either 
of these occur with greater frequency in this adjacent position ? 

According to which of these questions is answered in the affirmative must we 
conclude whether there is any connection or correlation between the proximity of 
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the chorised stamens to the peduncle and chorisis. The former would suggest no 
correlation, whereas the degree of correlation hinted at by the latter would depend 
on the frequency of the occurrence. 


We have so far considered only two possible positions, viz. a non-chorised 
stamen adjacent to the peduncle, ie. in the proximal half of the flower, and a 
non-chorised stamen opposite to the peduncle (i.e. in the distal half of the flower 
with reference to the peduncle), but the question naturally arises “ Are these the 
only two possible relative positions which might occur?” Might the petiole not 
twist so as to bring the hypothetic non-chorised stamen into any position varying 
from 0° to 180° with reference to the original plane ? 


Let us illustrate this by means of the Figure 6, Plate I. 


Taking the position of the peduncle as our fixed point the non-chorised stamen 
might occupy the “adjacent” position a or the ‘opposite” position al. A rotation 
of the petiole, however, might cause this stamen to occupy any of the positions 
marked a2, a3 or a4 or even any intermediate position between a and al on either 
side of the vertical plane A—B, in the horizontal plane a, a4, a2, a3, al. 


In a study of the variations in this flower, this is precisely what was found to 
occur, i.e. the distribution was equal round a fixed point so that we are unable to 
say whether there is any connection between the proximity of the non-chorised 


stamen to the peduncle and chorisis or not. 


But the full bearing of this consideration does not end here.- The orientation 
of the flower is of practical importance in fixing a basis on which to establish 
a grouping of the different variations. Any analysis of the data is impossible 
unless some definite part of the flower be agreed upon as a starting point. 


Now we have seen that the position of the peduncle with respect to any 
definite stamen does not require to be taken into consideration. Consequently 
we may take either of the two stamens of the outer whorl, which correspond in 
position to the outer sepals and which are “normally” non-chorised, as our fixed 
point and call it 1; the stamen opposite, ie. in the same whorl, we shall call 2; 
the chorised pair of the inner whorl to the left (or in the floral diagram above) 
may be termed 8 and 4; while the corresponding pair to the right (or in the floral 
diagram below) would thus be 5 and 6 (Plate I, fig. 7). 


Where variations occur in any of these stamens we shall hereafter refer to 


those as occurring in “position” 1, 2, 3, 4 and 5, 6 respectively. 


On this basis of symmetry, it will simplify matters considerably if we regard 
as 1, in flowers in which either of the two outer stamens is modified, that one 
which still maintains its original character while, on the other hand, if both are 
modified, that one which retains the greatest approximation to normality, e.g. if 
one be chorised while the other is not, the latter would be in position 1; or if one 
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were chorised while the other was only partially chorised* the latter would again 
be in position 1. 


Following on this it is at once seen that where both are normal or where both 


are equally abnormal it makes absolutely no difference which position we choose 
as 1, 


III. EXAMINATION OF THE DATA. 
1. Classification. 


Considerable difficulty was experienced in classifying the variations owing to 
these occurring in so many different forms yet with so few characteristics in 
common as to warrant their inclusion in definite classes. 

The total number of flowers examined was 1832, of which 1062 had the 
accepted normal structure (see page 218). The remaining 770 showed variation 
in different degrees of advance or regression, i.e. there was an excess or deficiency 
in the number and structure of the members of the various organs. Thus we see 
that there was a deviation from the accepted normal structure in over 42 per cent. 
of the individuals examined. 


The perianth has been selected as a basis for classification and Table A shows 


the sub-divisions which have been adopted. Amongst those flowers in which the 


TABLE A. 








Number | Number Number Number | Variations 
of in in in in the 
Variations Group | Sub-class | ~ Class Class 
| | 
Class I. Perianth normal ... ass see rea —- — 1687 — 
Sub-Class A. Gynecium normal see oe - — 1680 ~ - 
Group (a). Andrecium normal : sen] l 1062 -— — 
Group (6). Andrecium abnormal ... nel 57 618 ~ 
Sub-Class B. Gynecium abnormal _ -- : 
Group (a). Gyneecium one carpel ... eas 2 4 — 
Group (d). Gynzcium reduplicated 2 3 - ; 62 
| Class II. Perianth abnormal ee Ue aed - — 115 
Sub-Class A. Calyx normal, corolla abnormal 55 
Group (a). Gynzcium normal fe bas 1] 54 
Group (0). Gyneecium a single carpel ea l l 
Sub-Class B. Both calyx and corolla abnormal 60 
Group (a). Gynzcium normal em mel 11 46 : 
Group (6). Gynecium a single carpel x 6 14 - - 29 
SS a et eae Ur 91 1802 1802 1802 91 


* For the present we use the terms ‘‘chorised” and ‘‘chorisis” in the sense of the definition already 
given. 











wv. J. SIMPSON 223 


perianth was normal there were no fewer than 62 different types of variation, and 
amongst those in which the perianth showed a departure from the accepted 
normal structure there were 29 types of variation. Thus of 1802 flowers examined, 
1062 had the typical cruciferous structure, 625 had the perianth normal but the 
andreecium and gynecium modified in 62 different ways and 115 had all three 
organs modified in 29 different types of variation. 


The remaining 30 individuals are not capable of classification under the fore- 
going scheme but have been grouped into three classes as shown in Table B. 


TABLE B. 

Number | Number | 

of of individuals 

Variations | in the Class | 

| 

Class III. Reduplication of parts but flowers not separate ... | 10 11 
Class IV. Reduplication of parts with flowers separate... 6 | 17 
Class V. Part of a flower replaced by a flower oa oe 2 2 
Totals”... ae = we an ae a | 18 30 | 





Altogether, therefore, there are five separate classes which give a total of 109 
different modes of variation. 


2. Analysis. 


In the further reduction of the data it is essential that we consider the 
variations in the stamens, and for this purpose we must naturally commence 
with Class I, Sub-class A. 


To avoid describing each of these in detail, it is necessary to have recourse to 
a graphic method of representation. Several such methods suggested themselves 
and although none are ideal we have chosen one which may help to give a true 
impression of the various modifications assumed by the andreecium. We shall 
also give a few examples by another method which might have been adopted but 
which seems to us to be even more complicated. 


Let us, in the first place, consider in what directions abvormalities have 
occurred, A typical stamen consists of two parts, (1) the filament and (2) the 
anther. 

(1) Filament. This may be of its normal length or less than its normal 
length or altogether absent. 

(2) Anther. This may be present or absent. 

But other complications arise. As already explained, in the accepted typical 
cruciferous flower, chorisis has taken place in positions 3.4 and 5.6 so as to give 
29-2 
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rise to two stamens in each of these positions. Now, we find that, in certain 
flowers chorisis has only partially taken place and in others it has not occurred 
at all so that we have thus another three possibilities to consider. 


In describing the andreecium, therefore, we must (1) define the position of 
each stamen to which we refer, (2) state the nature of the filament, (3) note the 
presence or absence of the anther and (4) emphasise the nature of the chorisis. 


Let us use the following symbols 1, 4 and 0. 


Filament, indicates that it is present and complete. 
1 with reference to }Chorisis, indicates that it is total or complete. 
aad indicates that it is present. 


. (Filament, indicates that it is only half-length. 
4 with reference to {,, .. . 3. See : 
(Chorisis, indicates that it is only partial. 
(Filament, indicates that it is absent. 
0 with reference to + Chorisis, indicates that it has not taken place. 
\Anther, indicates that it is absent. 

We have already fixed upon our nomenclature for the various positions; these 
are 1; 2; 3.4; and 5.6. To avoid descriptions and at the same time give a 
graphic representation of the floral formula of the andrecium the following 
system might be adopted: 

(1) Place the whole floral formula within square brackets thus [ _ ]. 

(2) Place positions 1; 2; 3.4; and 5.6 within curled brackets thus { }; and 

(3) Place individuals, i.e. 1, 2, 3, 4,5 and 6, within rounded brackets thus (_). 

Expanding this with reference to a normal flower we would have for the 
andreecium only 

19) {9} (9) 
{2} {2} {25], 
or still further in the order of Filament, Chorisis, Anther, Stamen 
if f lf 1f \) 
[{(1.0.1)(1.0.1)} {.1.1)(1.1.1)} (1.1.1) (01.1. )}). 

Or, taking an actual example from our data : 

Stamen number 1 is normal and complete, and there is no chorisis; stamen 
number 2 has a filament only half-length but the anther is present and complete, 
and there is no chorisis; stamen number 3 is normal and complete; stamen 
number 4 is only half-length but with a complete anther—chorisis between 
3 and 4 is complete; stamens 5 and 6 are only half the normal length but have 
complete anthers—chorisis between 5 and 6 is complete. 


This would be represented thus: 


[{(1.0.1)(4.0.1)} (1.1.1) (§.1.1)} (4.1.1) (4.1. 1)}). 


| 
/ 
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Another graphic method and the one which we have adopted is as follows. 
Each flower is represented in a table similar to the following: 


Ber oe Tint ee See . ; ie. 2 See ee 


rs = 
Number of | 
| 


| Sen | Diagram Stamen | Filament | Chorisis Anther | 


SESS bee 


Se Sas ae 








The first vertical column gives the frequency of the variation or the number 
of individuals examined with this structure. The second vertical column gives 
the number of the corresponding diagram in the plates. The third vertical 
column gives the individual stamens in the positions already defined while the 
other three columns denote the various factors to be considered. The different 
possibilities of variation in these may be shown by the symbols 1, $ and 0 as 
already defined. It should be noted, however, that in positions 1 and 2 a dash 
(—) will be placed in the chorisis column to indicate that these are typically 
non-chorised stamens and that absence of chorisis does not therefore indicate 
abnormality. Representing the same example as before, by this method, we 
would have: 


Number of 


Frequency Diagram Stamen Filament Chorisis | Anther | 
pe ke Awe ie 
| | | 
1 1 | — 1 
“Wei cee Sere Sle 
a. ae 1 1 1 
eae ee at 
| 5 $ 1 1 
6 3 1 1 


The following table shows graphically the types of variations illustrated in 
Figs. I—LVIII, i.e. Class 1, Sub-class A, flowers in which the perianth and 
gynecium are both normal. 

It will be seen that in the flowers illustrated in Figs. XLVIII—LVIII another 
complication has crept in. Stamens 3, 4, 5 and 6 have themselves sometimes 
undergone partial or total secondary chorisis. In the tables, therefore, by sub- 
dividing the squares containing the details we can thus adhere to our initial 
nomenclature. Let us take the three most difficult examples to illustrate this. 

(1) Fig. XLVIII. The division corresponding to stamen 3 is sub-divided. This 
would indicate that in this position there were actually two stamens. The nature 
of each of these individual stamens is, as before, given in the sub-divisions. In 
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this example what really occurs is: There are two individual stamens each equal 
to the original length and bearing a complete anther and separated from one 
another by secondary chorisis. 


(2) Fig. L. Divisions 5 and 6 are sub-divided to show that there has been 
secondary chorisis in both of these stamens. In the former, chorisis has been 
complete but has resulted in one being fuil-length and with a functioning anther 
while the other is only half-length with a functioning anther. In the latter, 
chorisis has not been complete inasmuch as only the anther has been chorised. 


(8) Fig. LII. Divisions 5 and 6 are both sub-divided, consequently we may 
infer that both of these stamens have undergone some stage of chorisis. In the 
first column we see that all are full-length, in the third that all have functioning 
anthers, but the second tells us that chorisis has been only partial in each case. 
The symbol $ between 5 and 6 indicates that between these two chorisis has also 
been partial. Thus we conclude a state of affairs as follows: In position 5.6 
(1) there arises a single filament which divides into two at some distance from 
the base and (2) that each of these again sub-divides and (3) that on the end of 
each of these four sub-filaments there arises a functioning anther. The others 
may be worked out in a similar manner but a reference to the diagrams will at 
once obviate any misrepresentation. 


From the foregoing table and illustrations it is evident that further classi- 
fication is possible but it would be well to point out here certain difficulties 
which arise. As an example let us consider such a case as (using our original 
terminology) that in which, in any of the positions (1; 2; 3.4; or 5.6), the stamens 
are represented thus (1.1.0) (0.0.0), thus (4.1.1)($.1.1) or thus (4.0.1) (4.0.1). 
Which shall have precedence? If we are to consider these variations as deviations 
from the usually accepted normal cruciferous flower, then we may safely assume 
that that flower which has the greatest number of functioning parts in a certain 
position is less aberrant than one in which any or all of the parts are altogether 
wanting ; while, on the other hand, if in a position in which chorisis normally takes 
place, we have defective groups like those in cases 2 and 3 cited above, in one of 
which chorisis has taken place but not in the other, we must consider that group 
in which chorisis has occurred as being the one less removed from normal. On 
this basis then the above examples would be placed in the following order with 


regard to normality : 


(1) (4.1.1) (4.1.1); (2) (4.0.1) (4.0.1); (8) (1.1.0)(0.0.0). 


Similarly for any of the others. 

Consequently we are now in a position to classify the actual cases under 
observation. 

So far we have considered only those flowers in which there was the typical 
number of stamens, with their manifold variations in size and structure, but now 
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we must classify those in which secondary chorisis has given rise to more than 
the accepted number. 4 
¥ 


Let us take stamens 5 and 6 as our basis, i.e. those individuals in which 
position 5.6 is occupied by more than two stamens. 


The relative frequencies of the different types of variation in the andreecium 


in the 618 specimens so far considered (see Table C) are very interesting. 
The number 1062 in Table F refers to 1062 flowers in which the andreecium was 


TABLE D. 
} 
Number of | Number of 
Variations in | Individuals in 
mo; 
Section | Peco | Section bene | 
: | | 
Group a. | 
Whole of the andreecium normal. Fig. I ne oan “i es waa i ee 
Group 6. | 
Andreecium variously modified ... ae aes a we as aay — —}—]}]— 
Sub-group a. 21 — | 480 
Outer whorl normal (5 and 6 variously modified)... are ae eee — — — 
Section i. 
Stamens 3 and 4 normal. Figs. 1I—IX e a = — 8 |} — 419 | — 
Section ii. 
Stamen 3 normal, 4 represented thus ($-1-1). Figs. X—XIV ned 5 — 2 | — | 
Section iii. 
Stamens 3 and 4 thus {(1-0-1) (1-0-1)}. Figs. XV—XVIII ... = 4 | — | 9 — 
Section iv. | 
Stamens 3 and 4 thus {($-1-1)($-1-1)}. Fig. XXII... Re ree l — | 2); — 
Section v. 
Stamens 3 and 4 replaced by one. Figs. XIX—XX1 ... oe ne 3 ee eee 
Sub-group 8. is. |; — | 84 
Outer whorl represented thus {(1-—-1)(4-—-1)}... ae ~~ ds — ee eh a 
Section i. 
Stamens 3 and 4 normal. Figs. XXIJTI—XXIX as aa nae 7 — 42 — 
Section ii. 
Stamens 3 and 4 thus {(1-1-1) ($-1-1)}. Figs. XXX—XXXIV poke ee — 11 — 
Section iii. 
Stamens 3 and 4 thus {(4-0-1) (0-0-0)}. Fig. XXXV ... ie ot we -- l = 
Sub-group y. 
Stamens 1 and 2 represented thus {($-—-1) (4-—-1)} ... Br A. 4); — | 16 
Section i. 
Stamens 3 and 4 normal. Figs. XXXVI and XXXVII 5 oe ee — } ll — 
Section ii. 
Stamens 3 and 4 thus {(1-1-1) ($:1-1)}. Figs. XXXVIII and XXXIX | 2 | — | al ie 
Sub-group 6. 
Stamen 1 normal, 2 absent ... eve ee gu re =P me _ af ee 
Section i. | 
Stamens 3 and 4 normal. Figs. XL—XLIII ... Pee ae a a oe ee 
Section ii. | 
Stamen 3 normal, 4 thus (4-1-1).- Fig. XLIV ... fits te = 1 — 2 | 


Sub-group e. | 
Stamen 1! thus ($-—-1), 2 absent. Figs. XLV—XLVII eas co . = | — | 6 
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TABLE E. 


Number of Number of 


Variations in | Individuals in 





| group 
| Sub-group ». 
Stamens 1 and 2 are normal Zo ee as os bcecd 9 a 
Section i. 
Stamens 3 and 4 are represented by three. Fig. XLVIII _... 1 — 3 
Section ii. 
Stamens 3 and 4are normal. Figs. XLIX—LI ‘0 bog 3 — 16 
Section iii. 
Stamens 3 and 4 represented thus {(1-1-1) ($-1-1)}. Fig. LII i ea 2 
Section iv. 
Stamens 3 and 4 thus {(1-0-1) (1-0-1)}. Figs. LIII—LV ... 3 |; — 5 
Section v. 
Stamens 3 and 4 represented by one. Fig. LVI aan = 1 4 
Sub-group 6. 
Stamens 1 and 2 thus {($.—-1)($-—-1)}. Fig. LVII Son 1 a 
| Sub-group x. 
Stamen 1 normal, 2 absent. Fig. LVIII eee bee ae 1 — 


TABLE F. 


Frequencies more than 3 in order of magnitude. 


(References have been made to the figures.) 


Figure Frequency Figure Frequency Figure Frequency 
I 1062 XXVIII 1] XXV 7 
VIII 227 XXXVI 9 Vil 6 
ITI 130 VI 8 xX 5 
IV 38 XII 8 XLIII 5 
XX 21 XIII 8 LI | 5 
XL 18 XLIX 8 XVIII 4 
XXIII 15 IX 7 LVI 4 








} Section | Sub- Section 





Sub- 
group | 


30 


normal. Where variation occurs, the greatest frequency, namely 227, occurs in 


flowers in which one of the pairs in the inner whorl is replaced by a single 


stamen while the next highest frequency, namely 130, occurs in those flowers in 


which partial chorisis has taken place in the inner whorl of the andreecium. 


we find that stamens 1 and 2 are involved. 


30—2 


Following this the magnitude of the frequencies diminishes rapidly. The next, 
namely 38, occurs in flowers in which nearly all the parts of the andrecium 
are modified while, near this, is the frequency 21 which exists in flowers having 
only one stamen in each position. In the next two frequencies, namely 18 and 15, 
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From these raw data we can see that the inner whorl of the andrceecium is the 
whorl most subject to variation and further that this variation is in the direction 
of a decrease in number. 

TABLE G. 


Class I, Sub-class B: Perianth normal, Gyneciwm abnormal (see Table A). 





> ~ > | > “s > 
a ae - Eg a - 2 ee z be 2 ~ 2 % ~ 
5 os 5 2 L 2 2 sy oS 2 ps) = 
ere 1 a et eA” a a 
| 1 l 1 l 1 
2 1 — 1 2 l — 1 
. 3 ] l l , 3 1 1 1 
eal tr an ee ee ee ee ee 
5 1 0 1 | 5 1 l 1 
6 0 0 0 Ta oie l 1 
_ | ee | 
, 1 1 1 wo 1 
L l l 2 | Y — 1 
. l l l 3 l 1 1 
2 1 1 ok l 1 
: 3 1 0 l . ae 1 1 
l LXII Ff : 0 2 | LXIII 5 | , 1 1 
l 1 l 6 ca l 1 
| 5 ] l l l ] 1 
1 l 1 
| 6 l ae ee 





Class II: Perianth abnormal. 

Variations in the members of the perianth (calyx and corolla) have necessitated 
the introduction of new symbols in the diagrams. These are shown in the com- 
posite diagram Plate I, fig. 10, and are explained on p. 257. 

It will be evident from Table H, p. 233, that the same type of variation in the 
andreecium occurs with different types of variation in the perianth, e.g. in the 
second and fifth figures no fewer than six different variations in the perianth 
accompany a single type of variation in the andrecium. Reference to the 
diagrams in the plates will show what these variations are and will render a 
detailed explanation unnecessary. The asterisk in Table H under LXXVIII 
indicates that there has been adhesion between stamen 1 and one of the 
stamens in position 3.4, in other words between one of the members of the 
outer whorl and one of the members in the inner whorl. 


Class ITI. 

The members of this class are characterised by a reduplication of the various 
organs but without separation into two distinct flowers. There are in all 11 
individuals with 10 different types of variation. A word of explanation is 
necessary with regard to the interpretation of the position of the various stamens 
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The stamens in the outer whorl are longer than those in the 


Consequently if these were .reduced in length they might easily be 
mistaken for members of the outer whorl. 


In all cases of difficulty, however, the 


crucial test, the point of origin, was applied and the positions assigned to the 


various members as shown in the diagrams were determined microscopically in 


this manner. 


See Table I, p. 234. 
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Class IV. 


In this class there are 17 individuals giving six different modes of variation. 
Reduplication has taken place to such an extent as to give rise to two separate 
flowers on one pedicel. Each of the flowers was diminutive in size. Two tables 
are thus necessary for each “flower,” A and B: see Table J, p. 235. 


Class V. 


This class has been formed to include two very aberrant flowers showing two 
distinct variations. In both cases part of the flower has been replaced by another 
flower, in one case normal in the other slightly divergent. CIX is one of these 
in which the original flower is normal except that one of the carpels has been 
replaced by a small flower (see Table K and diagram, Plate X). CX is the other. 
In the original flower stamen 1 has been chorised and one of the chorised parts 
has given origin to a separate flower (see Table K and diagram, Plate X). 


TABLE K. 








Jlass V. 
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The asterisk denotes the position of the origin of the secondary flower. 

This concludes our analysis of variations LIX to CX both as to perianth and 
andrecium, but before proceeding to the statistical part it is desirable that certain 
peculiarities should be observed and that an understanding. be arrived at with 
regard to the interpretation of these. 
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In this procedure 44 variations, namely LIX to CII, must be dealt with. 
When we study the number of parts which occur in the position of individual 
members of a whorl and then try to draw conclusions as to normality or abnormality 
of the whorl itself we find the following difficulties. 


Let us take the outer whorl of the andreecium as an example. 


(1) If in the position normally occupied by stamen 1 there were two stamens 
and in the position normally occupied by stamen 2, no stamen occurred, then with 
regard to the whorl the total number of stamens would be two. Now this is the 
accepted normal number of stamens in the outer whorl, so that if number alone 
were considered the inference would legitimately be drawn from the table that the 
whorl was normal. But this is not so! 


Or (2) If in position number 1, one normal stamen occurred and in position 
number 2, one functioning stamen, with the filament only half the normal length, 
occurred, then the number of functioning stamens in the whorl would be two, i.e. 
the accepted normal number. But again, on the basis of number alone, we should 
not be able to say whether the whorl as a whole was normal or abnormal. 


Now as this state of affairs exists not only in the whorl under consideration 
but in all the whorls of the flower, we have thought it not only advisable but 
necessary to emphasise these abnormalities as a safeguard in the interest of 
systematic statistical treatment. 


For this purpose, therefore, small diagrammatic formule have been drawn up, 
and these have been given in conjunction with the diagrams: see Plate I, figs. 
11, 42. 


We have already defined the positions of the various parts of the androecium 
but have hitherto refrained from naming the different constituents of the perianth. 
In the cases under consideration, however, it is necessary to do so, and Plate I, 


fig. 11 illustrates how these are definitely determined. 


The two outer sepals are named A and B (see Fig. 11). A corresponds in 
position to stamens 3.4 and B to stamens 5.6. OC and D are the two inner 
sepals ; C corresponds in position to stamen 1 and D to the stamen in position 2. 
The petals are named A’, B’, C’ and D’ and lie respectively between sepals A and C, 


Band D, A and D, and B and C. 


The actual order of all the parts is summed up in Plate I, fig. 12 (1—14). 
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pare 


IV. STATISTICAL. 


The analysis which we have given of 1813 flowers is sufficient to show that the 
idea of a definite fixed number of sepals in the calyx, of petals in the corolla, of 
stamens in the andreecium or of carpels in the gynzcium of cruciferous plants is 
not upheld by an examination of a large number of flowers of this species. In 
less than 1 per cent. of the flowers examined there was an increase* or decrease 
in the number of sepals in the calyx; in less than 1 per cent. there was also an 
increase or decrease in the number of petals in the corolla, but in 2 per cent. there 
was an increase in the number of stamens in the andreecium, while in 22 per cent. 
there was a decrease in the number. 


Since then the number of sepals, petals and stamens is not absolutely fixed for 
any of the organs it becomes necessary now to consider whether the number of 
members in one organ is related to the number in the others. 


As has already been pointed out we have not only to consider organs as a 
whole, but, in the case of the calyx and the andreecium, the constituents of these 
organs, owing to the fact that these organs are each divided into two separate 
whorls which are inserted at different levels and are placed in directions at right 
angles to one another. 

Further, a special study has been made of the various positions in andreecium 
to ascertain to what extent bilateral symmetry may be regarded as an inherent 
character of the flower under consideration. 

By this means also it seems that some definite information might be obtained 
with regard to the perplexing and, at present, hypothetical theory of chorisis, the 
reasons for the existence of which have been summarised on p. 219. 

The statistical part has been divided into two sections: 

(1) astudy of the Means and Standard Deviations, and 


(2) a study of the Correlation Coefficients. 
1. Study of the Means and Standard Deviations. 


Although it is obvious from the analysis of the data under consideration that 
the numbers given for the botanical floral formula, namely, Calyx—4, Corolla—4, 
Stamens—6 and Gynecium—2, are the nearest integers, it is not at all certain 
from a mere inspection of the tables whether the actual means deviate from this 
number in the direction of excess or deficiency. 


* Where chorisis of a sepal or petal has resulted in two or more distinct individuals we have regarded 
each of these as a distinct sepal or petal in recording the numbers. This method is .atural however 
inasmuch as it is the only means by which we may possibly trace reduplication of parts. 




























constituents have been calculated and these 
TABLE M. 
Means and Standard Deviations of the 
Constituents. 
. Mean 
Organ Constituent Member eal 
Calyx 3°9796 
os Outer whorl 1°9757 
§ = Inner whorl 20039 
Corolla ... 3°9520 
Andreecium 58092 
a Outer whorl 1°9570 
= Stamen 1 “9840 
2 Stamen 2 ‘9713 
- Inner whorl 3°8577 
“ Stamens 3.4 19950 
AA Stamens 5.6 | 1°8627 

(1) The most obvious result which is revealed by these constants is the fact 
that in all cases (except the inner whorl of calyx) the actual mean of the organs 
is less than the recognised typical number, thus: 

The mean number for the calyx is 3°979 instead of 4. 

The mean number for the corolla is 3°952 instead of 4. 

The mean number for the andreecium is 5°809 instead of 6. 

(2) The inner whorl of the calyx shows the smallest departure from the 
accepted typical number, namely, 2°004 instead of 2. 

Let us, however, test how far the differences in the character of the analogous 
parts are significant by ascertaining the Probable Error of the difference of the 
means of the characters. 

TABLE N (1). 
I. Constituents of the Calyx. 
Oeiistiiinn Mean Standard 
OnavIEuEnS Number Deviation 
Outer whorl 1:9757 "1979 
Inner whor! 2°0039 *1304 

The difference here is D=°‘02813 and the 

E, ='0037; thus the value ——=T6. 
; Din am 
: m 


significant. 
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Standard 
Deviation 


*2553 
‘1979 
"1304 
*3523 








ate 


‘4863 





Consequently the mean and standard deviation for each of the organs and its 


are given in the following table: 


Number of the Organs and their 


Coefficient 
of 
Variation 


6°415 
10°020 

6°507 

8°914 
13°025 
13°817 
16°280 
19°715 
17°077 
13674 
26°107 


probable error of the difference 


The difference is therefore clearly 
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It is worthy of notice that the outer whorl of the calyx is more variable than 
the inner whorl and that it possesses on an average fewer sepals. 
TABLE N (2). 
Il. Members of the Outer Whorl of the 


Andrecium. 








Mernhae Mean Standard 
ee Number | Deviation 
Position 1 ie *9840 “1602 


Position 2 we 9713 | “1915 





: : D 

The difference here is D = 01268 and Zp, =‘00391; the value a 3°2. 
m D 

Thus when the two positions of the outer whorl of the androecium are taken 


m 


into consideration, a probably significant difference is found between the means of 
the distribution of the parts of this whorl. Now in position number 1 there is 
a greater approach to the accepted type owing to the fact that when the com- 
ponent of one of the positions of the outer whorl was found to depart from the 
accepted type, the other position was selected as the starting point for the 
orientation of the flower and was called position number 1. It is all the more 
noteworthy that the deviation for position 2 is not in the direction of greater but 
of lesser frequency and the variability of position 2 is greater. We have thus 
again a reduction in the value of the type with greater variability. 


TABLE N (3). 
III. Members of the Inner Whorl of the 


Andrecium. 


Mean Standard 


Member Number Deviation 
Position 3.4 ... 1°9950 2728 
Position 5.6 ... 1°8627 *4863 
. Mm—™M . " a ee — 
Here the ratio =~——— is nearly 15 and therefore there is yuite a significant 


— (m,—mMe2) 


difference between the means of the distributions of the two members of the inner 


whorl of the andreecium. In both cases the tendency is towards a suppression of 


functioning stamens rather than an increase, together with greater variability 
in the case where the reduction from the accepted type is more marked. 

This difference in variability is, in the main, real and is not due to the arbitrary 
selection of the 3.4 position. This will be evident from a study of Table XIX. 
It will there be seen that there were 1754 cases where two stamens occurred in 
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one of the positions of the inner whorl of the andreecium. This is the number 
in the accepted type, and thus there is no variability. What is the nature of the 
distribution of the stamens in the other position (Table XVIII)? It is as follows: 


1 2 3 a 


287 1436 20% | 5 | 1754 


The mean for this array is 1°8569 stamens, with a variability of -4116. Thus 
when there is no variability in one position of the inner whorl of the andreecium 
there is a large variability in the other position. 

Similarly we find the following distribution for position 2 in the outer whorl 
of the andreecium when position 1 is of the accepted type, i.e. shows no variability. 


57 1708 1 1766 


The mean for this array is ‘9683 with a variability of 1784. Again therefore, 
when there is no variability in position 1, there is a reduction of type in position 2 
with great variability. 


2. Study of the Correlation Coefficients. 


For the purposes of this study a number of correlation tables have been 
prepared and as the results of these will have to be considered under different 
groupings it seems advisable to tabulate them, and insert them consecutively. 
The system which has been adopted to facilitate reference is to commence with 
the outer whorl of the calyx and consider all its relations with the other whorls of 
the flower passing from the outside inwards; following this comes the inner whorl 
of the calyx and tts relations with the other constituents of the flower from the 
outside inwards and so on. 

The following table shows the characters studied and the correlation coefficients 
found. 

In order to’ make the comparison of the various correlations as complete as 
possible it will be necessary to consider each constituent or organ with all the 
other constituents or organs and to avoid overlapping as far as possible. The 
most natural method would be to commence either with the outermost consv-tuent, 
namely, the outer whorl of the calyx, or with the innermost constituent, namely, 
the inner whorl of the andreecium. For reasons of a morphological character, which 






will be seen later, the inner whorl of the andreecium has been chosen as the 
starting point. 
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tw 


from what was originally a single stamen in ancestral forms. 


TABLE 0. 


outer whorl of the calyx and the inner whorl of the calyx 

outer whorl of the calyx and the corolla si 

outer whorl of the calyx and the outer whorl of the a ndre cium 

outer whorl of the calyx and the inner whorl of the andreecium 

outer whorl of the calyx and the andreecium 

inner whorl of the calyx and the corolla met 

inner whorl of the calyx and the outer whorl of the ; andre sclum 

inner whorl of the calyx and the inner whorl of the andrecium 

inner whorl of the calyx and the andreecium 

calyx and the corolla on - 7 

calyx and the outer whorl of the andrecium 

calyx and the inner whorl of the andreecium 

calyx and the andreecium ... 

corolla and the outer whorl of the ‘androscium 

corolla and the inner whorl of the andreecium ... 

corolla and the andrceecium 

outer whorl of the andreecium and the i inner whorl of the andree cium ‘ 
inner whorl of the andrecium, position 3.4 and the inner whorl of the 
andreecium, position 5.6 ... ~~ _ Sue aie ae a sida 
outer whorl of the calyx and the inner whorl of the andreecium, position 3. 4 
outer whorl of the calyx and the inner whorl of the andrcecium, position 5. 6 
inner whorl of the calyx and the inner whorl of the andreecium, position 3. 4 
inner whorl] of the calyx and the inner whorl of the andreecium, position 5.6 
corolla and the inner whorl of the andreecium 3. 4 coe = : 

corolla and the inner whorl of the andreecium 5.6 ood 
outer whorl of the andrecium and the inner whorl of the androx sium, 
position 3.4 mee 
outer whorl of the 
position 5.6 


andreecium and the inner whorl of the androecium, 


(a) The inner whorl of the andrecium. 
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| XXVI 





Yr. 


*1957 
*7275 
“5886 
*2613 
"4371 
*2476 
*3229 
*B905 
"4592 
“6926 
"6245 
“4014 
"5721 
"4762 
1773 
*3174 
“1984 
*4305 


*4646 
2134 
"4634 


2519 


*2558 
‘0903 
‘2661 


*1539 


From the standpoint of the systematic botanist the most anomalous constituent 
the cruciferous flower is the inner whorl of the andrcecium, inasmuch as in each 


the positions where one stamen would naturally be expected, the presence of 


© is regarded as typical. 


O. W. Ca. = Outer whorl of the calyx. 
I. W. Ca. = Inner whorl of the calyx. 


Ca. = Calyx. 


Co. = Corolla. 

O.W. A. = Outer whorl of the andrceecium. 
I.W. A. =Inner whorl of the andreecium. 
A. = Andrecium. 





It has been explained in a previous section that 
botanists now usually regard this anomaly as having arisen by collateral chorisis 
For the sake of 
conciseness and in order to avoid unnecessary repetition the following abbreviations 
have been used in Tables P—X. 


Correlation Coefficients between the Number of Various Organs and their Constituents. 
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The following Table, P, gives the correlation coefficients between the I. W. A. 
& and the other constituents or organs of the flower in order of position. 
TABLE P. 
I. W. A. and the other Constituents. 


> l 
i Constituent or Organ Table Correlation 
is O.. Wo Ca. .:. = IV 2613 
Ee rw. C8. ... con: ats 3905 
| Ca. iti ao > Coe 4014 
Co. ats = XV ‘1773 
x CO: WA x. ee XVII "1984 


The highest correlation between the inner whorl of the androecium and the 
other constituents or organs is that with the calyx; next in order come the inner 
whorl of the calyx, the outer whorl of the calyx, the outer whorl of the androecium, 
and lastly the corolla. In other words, we should be better able to predict the 
number of stamens in the inner whorl of the andreecium from the number of 
members in the calyx than from the number of members in any other constituent 
or organ. 


(b) Relations between the organs themselves. 


Having thus discussed the inner whorl of the andreecium with the other organs 
and constituents it migit lead to some useful result if we proceed to determine 


’ 


the “organic correlation” existing between the various organs themselves. In 
this connection we have to consider the calyx, the corolla and the andreecium, and 
for this purpose the correlation Tables X, XIII and XVI have been prepared. 
The character which has been selected for this study is the number of members 
in each organ. 
The following Table (Q) shows the results obtained : 
TABLE Q. 


Correlation Coefficients between 


Oa: dnd Ons» "6926 
Os.and A. ... 5721 
Co:and‘A. ..; 3174 


SES 


(1) The calyx and corolla are much more highly correlated to one another thau 
is either of these with the andrecium. In other words, the two protective organs 
of the perianth are more highly correlated to one another than is either protective 
organ with the male reproductive organ. It is further evident that (2) the calyx 
is much more highly correlated to both the corolla and the andreecium than are the 
two last named to one another. From (1)it may be concluded that, on an average, 


32—2 
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an increase or decrease from the accepted typical number, namely four, of petals in 
the corolla is accompanied by an increase or decrease in the number of sepals 
in the calyx ; while from (2) an increase or decrease in the number of stamens in 
the andreecium will be accompanied, on an average, by a greater increase or decrease 
in the number of sepals than in the number of petals. 


(c) Relations between the constituents of organs. 


The constituents of (1) the calyx and (2) the andreecium will now be con- 
sidered. 

(1) Calyx. The outer and inner whorls of this organ are inserted at different 
levels and have a decussate arrangement, so that, although the organ as a whole 
is protective in function, the two whorls actually help to enclose the flower at 
right angles to one another. The correlation between these two whorls is an 
extremely low one, namely, ‘1957 (Table I), in other words, an increase or decrease 
in the number of sepals in either of the whorls of the calyx is associated only in 
a very small degree with an increase or decrease in the number of sepals in the 
other whorl. Or again it may be expressed thus, the two whorls of the calyx 
vary to a great extent independently of one another. This statement should be 
taken in conjunction with that made on p. 244 with regard to their Means and 
Variabilities and should also be borne in mind when the correlation between 
these two constituents and the other parts of the flower are discussed below 
(see Tables R and S). 

(2) Andracium. This organ is also composed of two whorls, an outer and 
an inner inserted at different levels. Its function is of course reproductive. 
The correlation between the two constituents is very low, namely, "1984 (see 
Table XVII), and is almost the same as that between the two whorls of the calyx. 
The inner whorl of the andrcecium shows greater variability than the outer whorl 
and tends to vary independently of this latter constituent, just as in the case of 
the two whorls of the calyx. 

Having thus considered the organs per se, let us now compare the correlations 
between each individual constituent or organ and all the other constituents or 


TABLE R. 


(d) Correlation Coefficients between the Outer Whorl 
of the Calyx and 


2nd Component Table ¥. 

Ss A ae I "1957 

RAB. “se or [I °7275 

oe. eee [I] “5886 

Be Wes. [V "2613 
Vv 


A. ‘4371 














% 


1 


fd 
® 
4 












































J. J. Srmmpson 249 


organs. For this purpose it will be necessary to tabulate the results im series and 
re consequently it might be well to start with the outermost constituent of the flower, 
namely, the outer whorl of the calyx, and tabulate the correlation coefficients 
passing inwards to the andreecium. The inner whorl of the calyx will next be 
taken in relation to the other constituents and so on. 


& 


From the above table it will be seen that the outer whorl of the calyx is most 


a highly correlated with the corolla; it is also highly correlated with the outer whorl 
cd 5 J ? 5 y 
4 of the andreecium but much less so with the inner whorl of the andreecium. 
~| 
bd TABLE S. 
(e) Correlation Coefficients between the Inner Whorl 
of the Calyx and 
a a —— ——— 
. | 
z 2nd Component Table | Yr. 
a ee VI 2476 
WA. Vil *3229 
Rs Ws ee yer Vill 3905 
ae oe eos [X *4592 


The low correlation between the inner whorl of the calyx and the corolla is due 
to the close adherence of the former to type, that is, there is very small variability. 


TABLE T. 


(f) Correlation Coefficients between the Calyx and 
2nd Component Table r. 
eee Xx "6926 
Ae ER XI 6245 
Se ee XII 4014 
Be Ae : XIII ‘5721 


There is a higher degree of correlation between the two organs of the perianth 
than between the calyx and the andrecium. The high correlation between the 
calyx and the outer whorl of the andrecium is mainly due to the high value 
obtained for the correlation between the outer whorl of the calyx and the outer 
whorl of the andreecium. 

TABLE U. 
(q) Correlation Coefficients between the Corolla and 


2nd Component Table r. 


i ae as XIV “4 
Sy Se Seer XV l 
A. i 
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The corolla is much more highly correlated with the outer whorl than with 
the inner whorl of the andreecium, and the correlation between the corolla and the 
androecium as a whole is not very great. 


A comparison of Tables T and U shows that there is a much greater correlation 
between the calyx and the andreecium and its two whorls, than between the corolla 
and the same constituents. 


So far we have considered the relationships between the different parts of the 
flower from the outside inwards, but when we examine these relationships, taking 
the inner whorls as our starting point, some new aspects of the problem become 
manifest and, as these have been of great value in the interpretation of the results, 
it has been considered advisable to tabulate them thus: 


TABLE V. 


(h) Correlation Coefficients between the Inner Whorl 
of the Andrecium and 


2nd Component Table | r 


+ ae eu XII | *4014 
>) ee Vill *3905 
GW, Oe. ... IV *2613 
a XVII "1984 
a ete XV 1773 


TABLE W. 


(2) Correlation Coefficients between the Outer Whorl 


of the Andrecium and 


2nd Component Table Fr. 

> See XI | -6245 
Wh Wie OR voce II] | *5886 
OO. ss a XIV 4762 
Be Wetted Vil | °3229 | 


A comparison of Tables V and W shows that the correlations between the 
outer whorl of the andrcecium and the other components are higher than for the 
inner whorl of the andrecium, except in the case of the inner whorl of the 
calyx. 
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TABLE X. 


()) Correlation Coefficients between the 
Andrecium and 


2nd Component Table e. 

J em XIII ‘5721 
= a eee IX "4592 
he eS Re V °4371 
ERK. = XVI “3174 


This table shows that when the andreecium is considered as a whole it is most 
highly correlated with the calyx and least correlated with the corolla. 


V. MORPHOLOGICAL SIGNIFICANCE OF THE STATISTICAL RESULTS. 

It is quite clear from the tabulated results that there is a definite departure 
from the usually accepted cruciferous structure in a very large number of the 
flowers of Lepidium Draba which have been examined for this study. This does 
not obtain merely in any one organ or constituent but in all the organs and 
constituents, although not to the same degree in each. 


The statistical results will now be examined from the standpoint of the 
botanist in order (a) to note their morphological or genetic significance and 
(b) in order to see whether these figures throw any light on the evolution of this 
cruciferous plant. 


It is almost axiomatic to state that the “purpose” of a flower is a purely 
reproductive one and that therefore its existence is justified only in so far as it 
serves to reproduce its kind. But not all the parts of a flower are solely repro- 
ductive in function. Each individual consists of two parts, (1) Reproductive, 
(2) Protective. (1) The reproductive organs are the gynecium (2) and the 
andreecium ('), while (2) The protective organs (perianth) are the corolla and 
the calyx. 


One oi the organs of the perianth, namely the corolla, is still further specialised. 
The calyx consists of four sepals, green in colour, whose sole function is to protect 
the flower when in the bud, and in many cases these are reflexed immediately after 
the flower has opened up, and are of no further importance to it. On the other 
hand the petals though essentially sepal-like in structure, in this as in the great 
majority of flowers, are not green but of some other colour. In the species under 
consideration they are white. Now although the petals are of great importance 
in protecting the reproductive organs while in the bud their utility does not cease 
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when the flower opens but, along with small nectaries at the base of the stamens, 
serve as an attraction for insects whose visits are essential for cross-fertilisation. 


The reproductive organs of what is regarded as the typical cruciferous flower 
consist of (1) the gynzecium which is composed of two carpels and (2) the andreecium 
which is composed of six stamens. The stamens are delicate structures and do not 
hold an isolated position in the flower. When in the bud and immature they are 
subject to external influences, for example, (1) they might be shrivelled up by the 
heat of the sun, (2) they might be blasted by rain or wind or (3) they might be 
attacked by herbivorous insects, so that the protective perianth plays an important 
part in flower economics. Now what does an increase in the number of stamens 
imply? It is obvious that if the number of stamens is increased the total volume 
occupied by the reproductive organs is increased and consequently a tax is put 
upon the protective organs if they are to fulfil their function adequately. If the 
perianth does not respond to this tax from space considerations, the reproductive 
organs stand a small chance of ever fulfilling their function, so that one would 
naturally expect that variation of some kind in the perianth would follow variation 
in the reproductive organs. 


Another important point which must never be lost sight of when interpreting 
the statistical results is the symmetry of the cruciferous flower. The calyx consists 
of two whorls each with two sepals; the corolla of one whorl of four petals and the 
androecium of two whorls of stamens, the outer having two members and the inner 
four members (see Plate I, fig. 7). Consequently a cruciferous flower is bilaterally 
symmetrical only on that vertical plane which passes through the division wall of 
the carpels, between each of the pairs of stamens in the inner whorl, between two 
petals on either side and through the middle of the outer pair of sepals. This 
plane may be referred to as the “plane of symmetrical division.” Owing to the 
fact that the corolla consists of only one whorl, the outer whorl of the calyx 
corresponds in position to the inner whorl of the andrcecium, and the inner whorl 
of the calyx to the outer whorl of the andreecium. 


From a study of the Means and Standard Deviations of the various organs and 
constituents we arrive at the foliowing conclusions : 

Calyx. (1) The greatest approach to constancy in number in the whole 
flower is in the inner whorl of the calyx. 

(2) There is a significant difference between the means of the two whorls. 

(3) There is much greater variability in the outer than in the inner whorl of 
the calyx and on an average it possesses fewer sepals. 

(4) There is a tendency towards a reduction from type in the number of 


sepals in the calyx. 


Corolla. (5) There is a tendency towards a reduction from the accepted 


typical number in the number of petals in the corolla. 
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Andrecium. (6) There is a significant difference between the means of the 
distributions of 


(a) the members of the two whorls of the andreecium, 
(b) the members of the two positions in the inner whorl, 
and (c) the members of the two positions in the outer whorl. 


(7) From whatever axis we view the andreecium as an organ it is distinctly 
asymmetrical in the distribution of its functioning stamens. 


(8) There is much greater variability in the inner whorl than in the outer 
whorl of the andrcecium. 


(9) In both positions in the inner whorl of the andreecium there is a tendency 
towards a reduction from the accepted typical number of stamens and in the 
position where this is most marked there is the greatest variability. 

The interpretation of these results is not at first sight very evident. 


Why should there be a tendency towards a reduction in the number of 
members in the different organs of the flower and why should this tendency 
be most marked in the inner whorl of the andreecium? As has already been 
pointed out all the flowers examined were taken from a single plant which gave 
rise to new stems by means of buds on the roots. May this tendency to reduction 
in the parts of the flower whose function is sexual reproduction not be an expression 
of a tendency towards an elimination of sexual in favour of vegetative reproduction ? 
Another phenomenon which lends support to this hypothesis is the fact that in 
this plant the percentage of “pods” which attain maturity is extremely small. 

Whether there is or is not a tendency towards vegetative reproduction, may 
we not also have here a harking back towards an ancestral form in which the 
number was less than the at present accepted typical number? In fact one would 
expect that if the present constitution of the inner whorl of the androecium had 
been most recent in development, reversion would first take place in it, and 
conversely one might reasonably conclude that since this whorl shows greatest 
variability, and most marked tendency to reduction in the number of members, it 
is more than probable that its present constitution was arrived at by an increase in 
number from a more primitive type. 


Let us now examine the deductions made from a study of the correlation 
coefficients and see if they have any morphological interpretation. 

(1) The calyx and corolla are more highly correlated with one another than is 
either of these with the andreecium. 


(2) The calyx is more highly correlated with the andreecium than is the corolla. 
In other words, the two protective parts are more intimately associated in increase 
or decrease with one another than is either of these with the male reproductive 
organs, and further the calyx which is solely protective in function is more 
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intimately correlated with the male reproductive organs than is the corolla which 
serves as an attraction for insects as well as a protective covering of the bud. 


(3) The two whorls of the calyx are not highly correlated, ic. they vary inde- 
pendently of one another. 


(4) The two whorls of the andreecium also are not highly correlated. Morpho- 
logically this means that when there are two constituents in one organ, each having 
the same function, they may vary independently of one another, so that although 
an increase or decrease in the number in either may be correlated with an increase 
or decrease in the number in any other constituent of the flower, the same does 
not hold true with regard to the two constituents. 


(5) The outer whorl of the calyx is most highly correlated with the corolla, 
next with the outer whorl of the andresciura and lastly with the inner whorl of 
the andreecium. The reason why the outer whorl of the calyx is more highly 
correlated with the outer whorl than with the inner whorl of the andreecium is 
not at first sight very evident, but may be explained on the basis of its protective 
power. The members of the outer whorl of the androecium lie in a plane parallel 
to that of the outer whorl of the calyx, and are much more widely separated in 
this plane than are the members of the inner whorl of the andreecium. Con- 
sequently any increase in the number of stamens in the outer whorl would involve 
a much greater increase in volume within the flower than a corresponding increase 
in the number of stamens in the inner whorl. Thus we are not surprised to find 
that such an increase in the outer whorl of the andreecium is more intimately 
associated with an increase in the outer whorl of the calyx than a corresponding 
increase in the inner whorl of the androecium would be. 


(6) There is very low variability in the inner whorl of the calyx and it is 
\ J 

almost equally correlated to the two whorls of the andreecium. The morphological 
explanation of these facts follows as a corollary to that given above. 


(7) The calyx is much more highly correlated with the andrcecium as a whole 
and with its two whorls than is the corolla. 


As we have already said the calyx is the predominantly protective organ and 
consequently this higher correlation has a physical basis. The corolla being partly 
attractive does not enter so closely into space economics. 


(8) The outer whorl of the andreecium is more highly correlated with the 
other components of the flower than is the inner whorl of the andrecium. ‘This 
again follows on the basis of space considerations. Any increase in the number 
of members in the inner whorl of the andreecium does not involve so radical a 
change in the volume of the flower as does a corresponding increase in the outer 
whorl of the andreecium. 
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VI. VARIATION IN THE GYNACIUM. 


So far we have not considered the gynzcium on account of the small number 
of variations which occur in that organ and from the fact that these do not lend 
themselves to statistical treatment. 


The gynecium consists typically of two carpels which are flattened in a vertical 
plane parallel to those containing the pairs of stamens in the inner whorl of the 
andreecium. The thin partition wall separating the two carpels therefore stands 
at night angles to this plane. 

Now when we examine the different types of variations in the structure and 
number of the carpels we find the following: (1) a single carpel, (2) two carpels 
typical), (3) three carpels, (4) four carpels, (5) two sets of two carpels within 

y} » (9, pels, ( pels, (9, 
a single perianth, (6) two sets of two carpels within separate perianths but on one 
pedicel. 

Let us now proceed to examine each of these in some detail. 

(1) The gynecium consists of a single carpel (see Figs. LXXXVII—XCII). 

In all these cases, except LXX XVII, as will be at once seen by reference to the 
figures, the suppression of a carpel is accompanied by the suppression of some of 
the members of nearly all the other organs thus: 

In LX XXVIII two petals are absent and one stamen is aborted. 

In LXXXIX one sepal, two petals and two stamens are absent. 

In XC, XCI and XCII all the organs are deficient in members. 


A noteworthy phenomenon in this respect also is that the suppression of 
members which accompanies the suppression of a carpel is usually in the vertical 
plane which passes through the plane of separation of the carpels. 


(2) The gyneciuin consists of two carpels. 


This is the accepted typical structure and the statistical study deals with these 
in detail. 


(3) The gynecium consists of three carpels (see Figs. XCIII and CII). 

When three carpels occur in the gynecium they are never found co-laterally, 
i.e. the additional carpel is never found with its origin at the side of a carpel, but 
always arising from the plane of separation, which is in the plane of greatest 
variability. 

(4) The gynecium consists of four carpels (see Fig. Cl). 

Just as in the previous case the increase in the number of carpels takes place 
in the plane of separation of the carpels—one on either side, so that a cruciate 
structure is found. A reference to Fig. CI will make this clear. In both of these 
groups it will be evident that an increase in the female reproductive organs is 
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associated not only with an increase in the male reproductive organs but also in 
an increase in the protective organs or perianth. 


(5) The gynecium consists of two sets of carpels within a single perianth. 


This is rather an anomalous group but is extremely interesting inasmuch as it 
contains a series of annectant forms linking group 2 to group 6. What we actually 
have here is a complete reduplication of the reproductive organs encased within 
a single series of protective organs. In some of the flowers examined with this 
structure it was rather difficult to determine the orientation owing to a torsion 
of the thalamus, but in the types figured on Plates IX and X (Fig. XCIV and 
Figs. XCV et seq.) the mode of origin of these is quite evident. Several 
important observations on these forms may be stated. 

(a) There are really two complete sets of reproductive organs and in one case 
(see Fig. XCVII) each of these is of the typical cruciferous structure. 


b) Increase in the number of the reproductive organs is accompanied by an 
5 
increase in the number of members in the protective organs. 


(c) The increase in the number of members of the reproductive organs is for 
the most part in the plane of division of the carpels, in other words, in the outer 
whorl of the calyx and its associated petals. 


(d) This is also the plane along which the separation of the reproductive 
organs has taken place. 


(e) This plane is the one which we have already shown in the statistical part 
to he the plane of greatest variability. 

(6) The gynecium consists of two sets of two carpels within separate perianths 
but on one pedicel. 


In this group we reach the limit of variability in the material examined. In 
place of a single flower consisting of calyx, corolla, andreecium and gynecium we 
actually find two complete sets of all these organs, on one pedicel (see Figs. CIII— 
CVIIT), while in one case (Fig. CIII) each of the two flowers has the typical 
cruciferous structure, so that were each of these separately examined it would 
undoubtedly be regarded as a normal flower. Yet we must bear in mind that, 
botanically considered, one flower and one flower only arises from a pedicel. Were 
this, therefore, an isolated example, and if no annectant forms existed, the departure 
might well be regarded as a “mutation,” but a consideration of the numerous 
variations which we have already considered, taken in conjunction with group 5, 
only serves to emphasise the fact that “the vertical plane which passes through 
the partition wall of the two carpels and consequently separates the individuals 
of the pairs of stamens in the inner whorl and passes through the centres of the 
sepals of the outer whorl of the calyx is a plane along which this flower is in a 
state of flux and is the plane in which it is probable that the flower has changed, 
and is still changing, from some quite different ancestral form.” 
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VII. SuGGeEstions ror FururRE STUDIES IN THIS PLANT. 


It must be very obvious to anyéne who has perused this paper that the results 
which might be obtained from a study of this plant are by no means exhausted. 
An attempt, however, has been made to interpret the variability in its flowers, 
both from a morphological and an evolutionary standpoint. Studies of a different 
nature might be undertaken in order to test the results obtained, e.g. : 


(1) What is the degree of fertility in the flowers of this plant? For this 
purpose it would be necessary to find the percentage of flowers which produce 
fertile seed. 


(2) What are the variants, if any, which are associated with infertility ? 


(3) What are the characters of the flowers which are produced from the seeds 
of the different variants? If seeds selected from the different variants were grown 
separately and self-fertilised, one could trace the variations in the flowers of the 
next generation and see to what extent the different variations were transmitted. 
This study is capable of much elaboration and is one which would be fraught with 
great possibilities. It seems to involve a satisfactory method of determining how 
far these variations are concerned in plant economics, and also to what extent 
they have been instrumental in the evolution of the Order Cruciferz. 


EXPLANATION OF FIGURES 8, 9 AND 10. PLATE lI. 


FIGURE 8. 
(a) Typical stamen (outer whorl). 
(b) Stamen with half-length filament and complete anther (outer whorl). 
(c) Typical stamen (inner whorl). 
d) Non-chorised stamen with two complete anthers (inner whorl). 
e) Stamen of inner whorl with two complete anthers but only chorised in the upper half. 


FIGURE 9. 


(a) Aborted stamen of outer whorl, i.e. filament with no anther. 

(b) Absence of stamen in outer whorl. 

(c) Full-length filament in inner whorl with no anther. 

(ad) Half-length filament in inner whorl with complete anther. 

(e) Half-length filament in inner whorl with no anther. 

(f) Non-chorised stamen in inner whorl with half-length filament but with two complete anthers. 


FIGURE 10. 

(a) Normal sepal. 

(b) Sepal divided almost to the very base. 

(c) Sepal completely divided into two distinct sepals. 
(d) Sepal absent. 

(e) Normal petal. 

(f) Petal divided almost to the very base. 

(g) Aborted petal. 

Petal absent. 






































Outer Whor!] of 


Andrcecium. 
Number of Stamens. 


Inner Whorl of Calyx. 





5 
2 
— 
ood 
<q 


Pc} 
69) 
x 
= 
3 
rs 
A 


Number of Sepals. 





TABLE I. 
Outer Whorl of Calyx. 
Number of Sepals. 
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0 1 | 2 |s 4 {Totals 
0 _ 1 — 1 
1 2 “S eae 3 
g 2 48 1748 | 3 2 1803 
3 “ ” ee ee 0 
l = 8; 3 6 
Totals} 2 | 50 | 1753 | 6 2 1813 
TABLE III. 
Outer Whorl of Calyx. 
Number of Sepals. 
0 1 D 3 4 {Totals 
| 
| 0 -— — 5 ~ — ) 
ee 2 | 42 40 | 1 = 85 
| 2 Se 6: Se ee ey 
a = “Ss tier 3 
Fe a 2/3 | 2 7 
Totals] 2 | 50 1753 | 6 | 2 | 1813 





TABLE V. 
Outer Whorl of Calyx. 


Nuinber of Sepals. 








0 1 4 4 Totals| 
4 8 30 38 
5 2 10 325 367 
6 _ 2 | 1365 — | 1367 
7 — 19 19 
8 12 l 13 
9 - l ] 
10 l l _ 2 
12 l l 2 
13 l l 
14 l l 
18 _- 2 2 





Totals 
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TABLE II. 
Outer Whorl of Calyx. 
Number of Sepals. 


Totals 











| 0 ] 2 > 4 
qa|of—| 3} t}-] 4 
=| 1 fiw: 23> 3 
isi 2 — | 12 ¢\— += 18 
r 3 34 Stee 42 
| £ 3] =) es ~ | 1734 
Sl. £ Bu ]-t Tees s 
s| @¢.4—1|—} 8} £424 4 
7, 

Totals} 2 | 50 | 1753| 6 | 2 | 1813 





TABLE IV. 
Outer Whorl of Calyx. 
Number of Sepals. 





























0; 1 ? 3 | 4 | Totals 
=e 1 _ 1 | _- | —|— 1 
@ a eS 6 me) — hes 30 
Ba ee | 2) Se fT | ae 
5s 4 2 | 39 | 1399 | — | — | 1440 
y, 2 5 2 St. kL 28 
bide 6 -}—| 12});/—|]— 12 
5 8 8 Bia ae 4 
= g 9 ft Sil — 1 1 
a i 10 —i|i—| l1j|j— 1 
SA) I a a Oe Nx, 2 
~  |totals! 2 | 5011753! 6 | 2 | 1813 
TABLE VI. 
Inner Whorl of Calyx. 
Number of Sepals. 
0 1 2 | 4 4 {Totals 
73 ! Y 
?S 0 Y ] 1 1 
2 1 3 ~ 3 
fe < 2 1|— 17 18 
c 3 - l 4l 12 
2 9 4 I7S4. | = — 5 1784 
7 5 6 2 8 
= 6 — q y 
3 l 3 4 
mt | 
Totals} 1 3 1803 | O 6 1813 
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TABLE VII. 
Inner Whorl of Calyx. 
Number of Sepals. 


0 1 


“sae ter 
Paty 2 4 


J. 


J. 


Totals 
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5 1 
2 D 
a > 
co Vv 
= j 
< ? 


we bo 





Number of Stamens. 


Totals 





TABLE 


Ets | 1803 


IX. 


Inner Whorl of Calyx. 
Number of Sepals. 





1813 | 

















0 | 1 | 2 3 | 4 | Totals, 
| 
} = | | 
: 4 -- 38 | — | 38 
Z 5 l Ae 2 367 
z 6 - | 1367 1367 a 
F 7 - 19 | — | 19 3 
= 8 = 11 | 2 13 s 
a 9 — l | 7 
es 10 -|— l a 2 
ie, 2. ees 2 
dg; 18 J—'!- 1 | — l 
= | 1 -|- Lj —] 1 | 
18 —_|— - | 2 2 | 
Totals} 1 3 | 1803 | O | 6 | 1813 
TABLE XI. 
Calyx. 
Number of Sepals. 
Z 2 3 j ls |e | 7 |Totals 
= 0 1 | - | 5 
DM 1 4 t+] 39 l _- 85 
a 2 — | 7} 1704 ] l 1713 
8 - l l l 3 
4 “i Be ae ae 7 





Number « 








Number of Petals. 


Inner Whor!l of Andreecium. 


SIMPSON 


TABLE VIII. 
Inner Whorl of Calyx. 
Number of Sepals. 









Totals 

















Z 
5 4 ‘ 
3 3 294 294 
s 4 — 3 1437 — | 1440 
“i 5 l 26 I 28 
Aes = 1 1 12 
Ff > & 2 2 4 
aes l 
Z| 10 |— Lisa 1 
Ope) ae Pace 2 
| | 
Totals] 1 | 3 | 1803| 0 | 6 | 1813 
| 
TABLE X. 
Calyx. 
Number of Sepals. 
2 3 4 5 6 | Y [Totals| 
0 2 l l 4 
1 2 l 3 
2 ] 12 a) 18 
3 35 2 2 
| 1734 1734 
7) ] 1 3 l 2 8 
} 3 ] 








Inner Whor!l of Andreeci 


Number of Stamens. 

















5 | 49 | 1748 | 31 5. | B Pass 
TABLE XII. 
Calyx. 
Number of Sepals. 
2 3 4 5 6 7 {Totals 
1 ye re 1 | 
2 6 24 - - 30 
3 2 292 294 
| 1 | 38 | 1398 1440 | 
5 l y oe fe eg Pe Oy 28 | 
6 ll 1 | — 12 | 
8 l 2 l 4 | 
9 l 1 | 
10 | - —|1 - 1 
ee ew Sa aN Sy Ie 2 





Totals 


























Inner Whorl of Androecium. 


Andrcecium. 
Number of Stamens. 





Number of Stamens. 


TABLE XIII. 
Calyx. 


Number of Sepals. 








| ] | 
Z) Sob AAS 6 7 {Totals 
| | | Cat 
| S 
4 . 8 30 | - = 38 | = 
| 6 5 | 39 | 323 367 | 
6 2 | 1365 Beas 1367 | = 
ae _ 19 = 1D |. es 
8 = 10 1 2 - 13 | + 
9 l <= l 5 
10 - ~ l - 1 2 | 
2 _ 7 2 2 | 
a at 1 | 
i i4— l l 
18 ~ _~ 2 2 
| 
Totals} 5 49 | 1748 | 3 5 3 | 1813 





TABLE XV. 
Corolla. 
Number of Petals. 








0 1 2 3 Y 7) 6 {Totals 
1 l l 
2 6 24 30 
3 i—j|—| 3| 3/| 20 —| 294 
4 13 |3 1 9| 37, 1388 1440 
5 l l 2 22 | 2 28 
6 -- 11 l 12 


9 
10 
14 


noe = 





Inner. Whorl of Andreecium. 


Totals} 4 





Outer Whorl of 


TABLE XVII. 


Number of Stamens. 


P Andrcecium. 











0 1 D , Totals) 
P 1 l ] 
5 2 30 30 
g 3 5 | 289 294 
s 4 2 75 | 1363 1440 
a 5 3 4 19 | 2 28 
9 6 11 l 12 
5 8 1 l 2 { 
"= 9 “ l ] 
R:, 10 _ Gi l 
A | 14 —1% 2 
|Totals] 5 85 | 1713 | 3 7 1813 
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TABLE 


Corolla. 


XIV. 


Number of Petals. 


Totals | 








Totals 





TABLE 
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Corolla. 


Number of Petals. 





4 O-b2 D 3 4 5 | 6 | 
=| | 

2 0 hice Pte oe a 5 | 
ND 1 3 3 7 | 35 7  fopte 85 
~ Q — i! £') 30 | St. | 2 is 
he 3 Rafe te ao ee at 3 
S| 4 I- ire is 7 
7, 





4 | 

ge ct See Se i _| 38 

S 5 3 2 8 36 317 - 367 
ae 6 i 3 4 | 1359 - 1367 
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2s = = 10 l 13 
Ss| 9 = 
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eS 12 2 2 
Bl £8 - - - l 
Zz U : => ie 
18 — 2 











Totals 4 3 


18 | 42 





TABLE 


1734 


XVIII. 


Inner Whorl of Andreecium, position 3. 


Number of Stamens. 











1 4 3 h 5 6 7 {Totals 
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& 0 l 1 
a 1 30 287 317 
s 2 1} tae! @ ty = 1446 
ogi te { 26 | 6 eal Or 37 
S 4 5 2 7 
5 5 . l l | — 3 
os 6 0 
s 7 - - 2 2 | 
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Totals} 42 1754 9 4 3 0 2 
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TABLE XIX. TABLE XX. 
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. 2 Ree ex ye ree en = d ae) gee 5 
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3 2 [ g Z ' ) 

&,8)} 1 j—| s|) 4] — 42 &.3| o f—| 1] — es et 1 
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- set toe se Cee te 9 “4ion| 2 | 2 | 40] 1404] —| —] 1446 
a S8u| 4 ]—|-—| 2}/-}2 4| Say|/ 3 J—| 2] 3a3/2|/-—] 37 
e eee.| > |-|-| —|2!/-] 2] weBs] 47-|-| 6]/-]1 7 
fe Sae| ¢@ §—|- — |} = 0; 3224) & —}—| — |2]1 3 
L. eS) 7 f-1|-| —/2)- S| PS! eet] = Pee 2 

5 , | ' | | D ZA | | | 
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TABLE XXI. TABLE XXII. 
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Number of Sepals. Number of Sepals. 
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3 ¢ ] l 5 og 
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40e| 3 41|— 7 | 9| 3:-8/ 2 |1/ 3 | 1442 1446 | 
wae| 4 |—|- “? eae 4 wool f 2-1) Sie 7 | 
wain| & A aie deg 2 4 ee ae "Y peas = 7 
sSe@Z) 6 }—|—| — | so 0 os) 6 a) — aes ee 3 
= g 5 moo 7 iy pa Fa ee eae 
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Number of Petals. 
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Outer Whorl of Andreecium. 


Number of Stamens. 
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Outer Whorl of Andrecium. 


Number of Stamens. 
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NOCHMALS UBER “THE ELIMINATION OF SPURIOUS 
CORRELATION DUE TO POSITION IN TIME OR SPACE.” 


Von O. ANDERSON, St. Petersburg, RuBland. 


1. Im Aprilheft der Biometrika, hat “Student” gezeigt*, daB das von Cave und 
Hooker vorgeschlagene Verfahren, den Korrelationskoeffizienten zweier oscillieren- 
der Variablen durch Berechnung erster Differenzen (also durch Ersetzung der 
Reihe 2,, #,... 2, durch die Reihe: A’a, =a, — a, A’a,= @, — a3,... A’@p.=2n+—@n) 
vom evolutorischen Element zu befreien, eine Verallgemeinerung zuléiBt. Das 
Verfahren ist nimlich, streng genommen, nur dann richtig, wenn die evolu- 
torische Komponente durch eine lineare Gleichung darstellbar ist. Findet 
letzteres nicht statt, kann also, z. B., jener nur eine parabolische Gleichung 
hdherer Ordnung geniigen, so mu8 man zweite, dritte u.s.w. Differenzen nehmen 
(also statt A’a,, A’z,... nehme man A’z, = A’a,— A’a,, A” x, = A’a, — A’a, ... etc.) 
und danach Korrelationskoeffizienten berechnen. Letztere kénnen bald einen 
konstanten Grenzwert erreichen, der das gewiinschte Resultat darstellt. 


Unterzeichneter ist schon vor etwa 2 Jahren zu ahnlichen Schliissen gekommen. 
Durch von ihm unabhangige Griinde wurde er aber bis jetzt vom Drucke seiner 
diesbeziiglichen Schrift abgehalten. Da er bei seiner Untersuchung Wege ein- 
schlagt, die von denen des “Students” sehr verschieden sind, und auch zu manchen 
Schliissen kommt, welche letzterem unbekannt geblieben zu sein scheinen, so 
kénnte vielleicht eine kurzgefaBte Darstellung der wichtigsten Resultate seiner 
Untersuchung fiir die Leser der Biometrika von einigem Interesse sein. 


2. Methode. Die englische statistische Schule vernachlassigt in ihren Unter- 
suchungen ein Verfahren, das von russischen und deutschen Gelehrten oft ange- 
wandt wird (Tchebycheff, Markoff, v. Bortkiewicz, u.s.w.) und neben grofer Strenge 
und Exaktheit noch den Vorzug hat recht elementar zu sein—die Methode der 
mathematischen Erwartung nimlich. Mathematische Erwartung einer GroBe (A) 
heiBt bekanntlich soviel als das Produkt aus dieser GroéBe und ihrer Wahrschein- 
lichkeit (w), also Aw. Wenn eine Variable eine Reihe einander ausschlieBender 


* Biometrika, Vol. x. Part 1, S. 179, ‘‘ The Elimination of Spurious Correlation due to Position in 
Time or Space.’’ By ‘‘Student.” 
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GréBen annehmen kann, so ist deren math. Erwartung als die Summe der Erwar- 
tungen aller dieser Groen definiert. Wir werden hier die mathem. Erwartung 
iiberall durch das Symbol H'( ) bezeichnen. F(A) ist also, z. B., gleich Aw. 


Die hauptsichlichsten Satze iiber mathematische Erwartungen diirften als 
bekannt angenommen werden. Um aber die Nachpriifung der Formeln dieser 
Schrift zu erleichtern, werden wir hier die fiir uns wichtigsten Siitze noch kurz 
andeuten : 


(1) E(@+y—zt+u-t...)=EH(e)+H(y)-H(2)+H£L(u)—-L£L.... 
(2) Wenn z, y, z,... von einander unabhingig sind, so ist 
E(a.y.z....)=H(@). EH (y).£(2).... 
(3) E(kx)=kE (x), wo k const. ist; und daher auch : 
E(k)=k. 


(4) Wenn eine Variable X die Werte 2,, #,...%, annehmen kann, so ist 
die Wahrscheinlichkeit W, daB die Differenz #;—H(#) zwischen den Grenzen 





—aVE(a*)—[E(a)? und +a VE (a 


wo a gréBer als 1 sein mu (ein Theorem von Tchebycheff). 


)—[£(a)f enthalten sei, gréBer als 1 -5, 


In unserer Untersuchung werden wir iiberall statt des wahrscheinlichsten 
Wertes einer GréSe deren mathematische Erwartung berechnen. 


Bestimmen wir zuerst, wie sich der Korrelationskoeffizient zweier oscillierender 
Reihen verhalt, wenn man deren GroBen durch Differenzen A’x, A”x, A’”’z..., 
A’y, A”y, A’’y, ... ersetzt, und darauf untersuchen wir die Frage von den Grenzen 
der Anwendbarkeit der verallgemeinerten Cave-Hookerschen Methode. Um Raum 
zu sparen, werden wir nur die endgiltigen Resultate der Berechnungen anfiihren, 
ausgenommen die 3 ersten Formeln, deren Bestimmung als Beispiel der Rech- 
nungsmethode dienen mige. 


3. Definition. Unter einer oscillatorischen Reihe werden wir eine solche 


Reihe 
By y Ley Byy very Viz ves, By 


verstehen, bei der 
E (a) = E (a) =... = E (a;) =... =E (an) = E (a) = const. 
und alle einzelnen Glieder von einander véllig unabhingig sind, so dab 
E («;,a;) = E (a;). E (a), wobei i+). 


Solchen Bedingungen wiirde zwm Beispiel eine Reihe geniigen, deren Glieder : 
die Resultate einer Versuchsreihe mit konstanter Wahrscheinlichkeit darstellen, 
etwa Resultate von Ziehungen aus einer Urne mit m weiBen und n schwarzen 
Kugeln. 





















QO. ANDERSON 


4. Mittleres Fehlerquadrat. 
Bezeichnen wir 2;— EH (a#) durch &;, so ist 
K(&;) = E [a;,— F («)| = E (a) — E (a) =0. 
Da die einzelnen & von einander véllig unabhiangig sind, so ist 
E (&:§) = E (&) . E(&) = 0. 


Das mittlere Fehlerquadrat der Reihe a ist gleich 


$ [a; — EH (x) 
1 
"5S ea 


Seine mathematische Erwartung wollen wir (nicht ganz in Ubereinstimmung 


mit der iiblichen Bezeichnung) o,” nennen. 


{3 [e; - E(@)}| . : 
o=E|'——[=7| 8 | Sa | 2 || Bo) +n[E(e)Pt 
1 1 
= H (a*) -—[E (a)f, 


ein Ausdruck, der oben im Satze 4 (§ 2) unter dem Zeichen der Quadratwurzel 


steht. 
n : 7 rn 5 
(S [a,— E (w)p| [S| 
Andererseits ist aber H | Saat ace gleich E | 2 “| und daher 
n * n 


n 
Untersuchen wir den Ausdruck EH Ee _ M,)), wo M, das arithmetische 
1 


n 
4 


a Li 


Mittel der Reihe z, also — bedeutet. Da 
? 


(a) +8, +E (a) + & +... + E(x) + & 


a; — M,= E (a) + &- : =&—M: 


i 
(wenn man M; fiir — einsetzt), so haben wir: 
n 


E EG - M,} | =E | 7 my =E | See é ne 
1 Lie 1 


[Eee—2x¢8)| 
= nH (&)—nE (Mf) =nk (&)-nk | — | = nH (&) -— E(&). 


E E (a; = | (n aa 1) E (&). 
1 


Um E£(£) zu bekommen, muf man diesen Ausdruck durch (n —1) dividieren. 


35—2 
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Es ist also auch 


Um das Fehlerquadrat o*y, fiir die erste Differenz A’x zu erhalten, beriick- 
sichtigen wir, daB 
Aa; = (x; a i441) = (&; ~ Ei4 1) 
und EK (A’a;) = FE (&;) — E (€i4:) = 0. 
Daher haben wir: 
n—1 Ss ree n <1 
S [A -E(Wa)P| [EE Fn 


E | a | — | 
Eiki +E | Zee] 


> 


1 7 7 ‘ 1 / 
| f(n—1) E(E?)-—04+ (n—1) E(E?)} = 2E(&). 


Es ist also One = 20-2 
Nach demselben Rechenschema ergiebt sich fiir das mittlere Fehlerquadrat 
der zweiten Differenz A”’x; der Ausdruck 60,2 
» adritten = A’'2; 20¢,7 


vierten 


k-ten 


” 


Wir kénnen daher folgende Gleichung aufstellen 
re O'a'x ee Oo A"z s. Ong o*,V), 
eee Se a 
“ ) 


welche exakt ist, und folgende 


n 2-1 n—2 

4 i * a * ao 
a(e—-M," 2Aaee 4A’? 
1 1 1 1 


=<. ~9(n—1) 6(n —2) es 20(n—3) 


welche nur annihernd richtig ist. 


n-k n-k 
> (k) 72 p- () x -- ] 2 
* De: : » 1 —s . 1 ig Ma") 
Es ist vorteilhafter — und nicht srt zu berechnen. 
. kl (n—k) 








k 
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ERRATUM 


Page 273 line 4 from bottom 


3 [x -£ (x)|[y%-£()] 
1 - 


= read — 
n—-1 n—1 


3 [2 -M,J[y.—- My] 
1 
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n 
LHiYyi 
5. Das mittlere Produkt 7 : 


Wenn zwei Reihen 


und Ti» Yar Yas--> Yous 
beide im Sinne des § 3 oscillatorisch sind, und eine Korrelation nur zwischen 
GréBen mit gleichen Indexen, also zwischen 2, und y, « und y., 2, und ys, .3.W. 
bestehen kann, so ist es leicht ersichtlich, daB 
E \[x; — E (x)| [yj — F(y)]} =0, wenn i+). 
Bezeichnen wir y;— E(y) durch w;, «;— H(#) wieder durch &€;, so finden wir 
leicht folgende Ausdriicke : 


[ey 
Pay = E | | = E (Ei), 


n 


=F (&; Wr; )= Pay, 


("s > Na; F Ay: 


=E| , 


("S'ame,A%y,| = 


Paws, A®y = dl | : —— a kk 
Wir kénnen daher wieder zwei Gleichungssysteme : 
P _Pxed'y _ Py 2A” * Pa “a Al'y _ Pavone atny mt _ Pata Aly =F (E; i), «. (2) 
” 3 6 ae 70 m 2k! 
kik! 


und 


n n—1 n-2 n-3 
E(x -E@)|[y- Ey) LA'adly, VAwA’y FA” HA"y 


m1 «(m= 1) ~  G—=2) ~~ 20(n—3) 


n-4 n—-k 
> Age; Aly; > A®x,A%y; 
1 1 9 
e007 = Pry, --.(2a) 
—— .(n—k 
ki ki 
aufstellen, von denen das erste exakt und das zweite angenihert ist, und welche 
den Gleichungen fiir o,? (also auch fiir 7,?) des § 4 genau analog sind. 
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6. Das Fehlerquadrat der Fehlerquadrate. 

Betrachten wir jetzt den Bereich der Schwankungen der GréBen der Systeme 
(1a) und (2a) um deren mathematisch zu erwartenden GréBen in (1) und(2). Mit 
anderen Worten, gehen wir (mit Riicksicht auf Satz 4 § 2) zur Darstellung der 
math. Erwartung der Fehlerquadrate der genannten GréBen iiber. 


3 [ a; - E (x) 





Fiir + aa ergiebt sich das Fehlerquadrat BE (&) a (OP 
= [a#:— Mz] 7 (eA) _ TR (En\8 » ae 
Fiir * ergiebt sich das Fehlerquadrat B (E) — [E (&)] + 2[# (2) ‘ 
n—1 . n n(n—1) 
n-1 
> [A’a; 
Fiir 2a 5) ergiebt sich das Febhlerquadrat 
(2n — 3) {H (&) —[H (&)P} + 2(n—- 1) (2 (&)P 
2(n—1) j 
"S [A”aiF 
Fir ergiebt sich das Fehlerquadrat 


6 (n — 2) 
(9n — 23) {E (&) — LE (&)F} + 17n — 42) [EF (&)/ 
9(n—2) ; 


n—k 
= (A® a)? 
Und endlich fiir 507 ergiebt sich der recht komplizierte Ausdruck : 
irk (n =< k) 
1 


_ 2 —_ OL\SH 4; fH 2\ 12) "(£2\\27T 4 2 — 9k 
Pree tee (n — 2k) (E (E*) —[E(&)P} + 4 LE (E)P LAY (n — 2k +1) 
+ A? (n—2kh+2)4+A?(n—2k4+3)+...4+ AP (n—2k+k)] 

+ 2B? (E (&) —[E()P) + 8(E(E)P. (Be + Bet... + By). 


Wenn J, b,, b.,... b, die Koeffizienten der Zerlegung des Binoms (1 +1) 


darstellen, also b,=1, b, =k, ieee. u.s.w., so ist hier 
. Sei} 
A? = (b? + b+ 62 + tee + b2P = lerai| ’ 


k-lk+1! 
Az =(bybz + Byby + «2. + Dy-aby)? = E shee) 


z b | . 
A, = (Db; + byby + Dads + « + hah} =| 7 


Aj? = (bybj + Dydjss + «.. + Oe gh)? = E el 

















Nie ” LT TIN 











Be = (6,7)? + (b, + b?? + (b,? + 
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be + bey +... + (b2+624+b2 4+... 


By? = (bob,)? + (bd, + bb.) + (by b, + bb, + bed; + ... 


+ (bb, + b,b, + bobs +... 


+ Dyadp-a ¥; 


B? = (b,b;)? 2 (bb; + b, bi.) si (by; ss b, Diss + babi+2) +... 


so (bb; i b bis: + bebis+ ee 


+ Dyin ay’, 


—1 = (body). 


Wenn die Verteilung der # (und dies ist der fiir uns interessanteste Fall) 


‘normal ” ist, so kénnen obige Formeln 


betrachtlich vereinfacht werden. Daman 


in diesem Fall # (&) gleich 3 [H (&)} oder 30,‘ setzen kann, so haben wir : 


S (a, — H (a)P 


20,! ; 
he +— : das Fehlerquadrat = (vergl. Biometrika, Ul. p. 276). 
’ 
(a; — M,) Qo! 
; n—-1 n—1° 
S A’w.\2 
. (A 2; ) (3n es 4) ox! -s = 
> Bech " t. (n=) oder angeniihert 
n-2 
¥ (Ax. . 
oe ay (35n — 88) o,! 4o,! 
6 (n— 1) 9 (n— 2 sj i n—2° 
= 0s 
=(s wi) es dog 
20 (n — 2) 50 (n — 3)? : i 


n—k 
= (A® ax; 
Fiir ages endlich kann man 
(n —k) 
1 Ke! 


stellen : 


das Fehlerquadrat in solcher Form dar- 


o5.4 > \2 ; ki(k-1V) \2 
— ig. k)+2(n—k—-1) . + 2(n-—k—- 2)( \ ) 


(n—ky | =a 


+2(n —k-—8) 


k. 


+2(n—k—k) 


/ 
\ 
+2(n-—k—j)) (i 


(k ae (k+2) et 
( 


-(k—1).(k—2). 
k+1).(k+2).(k+3)...(e+)) 


(k +1) 


(k +1). (k + 2) 
.(k—1). (k— 2) 








Ena 


(k—1).(k—2)...2.1 y{ 
(kb +2).(k +3)... +h) )° 
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Es ist also klar, daB zusammen mit dem endlichen Differenzieren die Unsicher- 
heit der Bestimmung von a,’ stetig wichst, anfangs etwa im Verhiltnis 
ID 2f3 2/4: f/5.... 
7. Das Fehlerquadrat des mittleren Produktes. 


Ei: 








E (Eee) (EE? 














Fiir a ergiebt sich das Fehlerquadrat - 
> (a; — Mz) (yi — M,) 
Fir + <9 ergiebt sich das Fehlerquadrat 
E (Eiri) — LE Ei)? , LE Edi) P + ontoy? 
n n(n —1) , 
n—1 
z A'a;A'y; 
cok om — ernie H as Rehlerauadrs 
Fii 7% ergiebt sich das Fehlerquadrat 
(2n — 3) {E (Eee) — LE (Ev) P) + (n— 1) {LE (Ewa + 20,3 
2(n—1/ : 
n-k 
> A®a,A®y; 
Im allgemeinen Fall ‘Sag erhalten wir fiir das Fehlerquadrat folgenden 
EE») 
Ausdruck : 
1 
ee 2 —9R\IK 2qfp.2) — ny wh; 2) 
Temp 48 (2) (E GVA) — LE EF 


+2 {[F (Es) + o20,7} [A (mn — 24 4+1)4+ AZ (n— 2k 4+2)4... 4 AP (n—2k+k)] 
+ 2B? {EB (Ep?) —[E (Epi) Ph + 4 (LE (Epi) P + ot} (BP + BE +... + Br..)} 
wo die Koeffizienten A,’, A,’,... B,*, BY, ... dieselbe Bedeutung haben, wie in § 6. 
Wenn a; und y; einander vollkommen gleich sind, so ist 
[FE (En) P={L(@)P=oc; E(ERW?)=H(E*); oo =r", 
und obiger Ausdruck fallt mit dem in § 6 zusammen. 

Fiir den Fall der “normalen” Verteilung kénnen wir auch alle diese Ausdriicke 
betrachtlich vereinfachen, besonders wenn wir F (£;;) und H(&?y?) als Funktionen 
von 7,, darstellen. Olne aber hier darauf einzugehen, wollen wir jetzt iiber den 
Korrelationskoeffizienten ins Klare kommen. 

8. Definition des Korrelationskoeffizienten. 

Der Korrelationskoeffizient R wird gewéhnlich nach der Formel 

n 
> (a; - M,) (Yi = M,) 
R,= L —_- berechnet. 


af 3 (a; — Mz? (yi—M,)* 
1 1 








te Rh 











FELT 
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Zu welchem Ausdruck ist er nun als empirische Anniherung aufzufassen, 


. (x; sa M,) (Yi = My) - E E (a; _ M,) (Yi * m,)| 
1 








zu i > aeneemmeeee —— oder 22: —— SF 
J Sea —M,) > S E (% — M,y E (> («;- M,)).E [2 (yi- My] 
1 1 


Beide Formeln sind durchaus nicht mit einander zu identifizieren und fallen 
nur in erster Annaiherung zusammen. Da die zweite aber bedeutend leichter zu 
handhaben ist und dies auch mehr den iiblichen Rechnungsmethoden der englischen 
Schule entspricht, so definieren wir rz, als 


ae (a; — M,) (4 sl seers 











: ) 
Anders ausgedriickt ist rzy = Pay , WO Pry, Gz, Fy die Bedeutungen haben, 
o  Saty : 


welche wir ihnen oben in §§ 4 und 5 beigemessen haben. 


9. Das Verhalten des Korrelationskoeffizienten zweier oscillierender Rethen 
x2 und y, wenn man deren GrédBen durch Differenzen ersetzt. 


Fiir die k-te endliche Differenz von 2 und y haben wir 








ye a Qe! 
Ei | > Aa, A, 7) ae < y) 
\4 yi _ Pata, Aty ki. gi? 
Alix, AMy ~ 7 .  « oo = 
—k = = " No- (hia a (e, 2k! 2k! 
E "S sae; ve _B > A 2 | Al Ay A = 
J BI 4 . Vi | k! k! x k! ki x 
: eS . ae 
Tax, A®)y — Oz Fy = Vgy- 


Wir haben also ganz allgemein das genaue Resultat : 
g 8 
Try = Ta'z, A’y = Ay, Ay = Ty's, A”’y =... = T Awe, A®y: 


Da aber diese r unbekannt bleiben und wir fiir ein beliebiges T Ae Ay Bur 
>) 


n—t 
> AM A%y; 
dessen Annaherungsformel oe ——— —-— kennen, so miissen wir 


u~-t ni 


> A%z? ° > SAN ye 


wiederum feststellen, inwiefern man sich in der Praxis auf die Ubereinstimmung der 
empirischen Koeffizienten mit deren mathematischen Erwartungen verlassen kann, 
wie groB also die Unsicherheit ihrer Bestimmung zu schatzen ist. 

Biometrika x 36 
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10. Mittleres Fehlerquadrat des Korrelationskoefiizienten der endl. Differenzen 
zweier Rethen. 








n-k 
S Ax. oA My 
Aus der Formel R,= — , kann man folgenden Ausdruck 
—- 
4/3 S A®z;. ‘S 2 Zamye 
ableiten : 
n—k n-k n-k 
> SAM We: sA% y; = Ag? 3 Ay? 
1 e 1 
— Drak k ———_ - ong ———— —o° x 
R; — Try n- poe eer. Pat da, Ay n Aa n Ay 
ay PaWrcA®y 20° AW) x 207 Awy 


der nur in erster Annaherung und, wenn alle 4 Briiche des Ausdrucks echte sind, 
richtig ist; dies ist bei groBem n der Fall. Und ferner ergiebt sich daraus die 
Formel : 


1-7, (fk - k.(k—-1 : 
oa,= Genie (0-H +2-E- 0 (P55) +20-b-9) (Gree) 
_-3)(:@-)--2) ges 
ters )(@ri) e+) m) + 
k.(k-1).(k-2).. ‘ 
2(n—k—k) | — 
3-8 =") 551.50. eD. a1 
"S “(A%a,) 
(vergl. dazu die Formel fiir —= in § 6) 
8 okt 
kei") 





Die Formel fiir o*, ist immer nur dann giltig, wenn man 


Pa {BEE Yi) —[E Epi)? , 2 (e) —[E (EYP, By) — [2 (y*)p 








n LE (Epi)? 4[H(E)] a[HP)P 
_ Ee) — EE) E)_ E (Eye) — E (Ei) EY) 
EE). B® EE). EC) 
E (Ey?) — E(&). el 
2E (B). BW) 
gleich ql —_ setzen darf (vergl. Biometrika, Vol. 1x. p. 4). 

Aus der Formel fiir oR erhalten wir: 
‘ Be. (1 i tay 
= Ry em n j 


ea (l—r yy? 3n—4 
Ri n—1l = *'2(n—1)’ 
_ (1-1)? 35n— 88 
60h n—2 °18(n—2)’ 
_ (1 — ry? 231n — 843 
Y wee °360 (n -$) : 








4 
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Die Fehlerquadrate der Korrelationskoeffizienten aufeinanderfolgender Diffe- 
renzenordnungen verhalten sich folglich zueinander ungefahr wie 2:3:4:5.... 


Die Unsicherheit wichst also mit zunehmender Differenzenordnung etwa im 
Verhiiltnis 
V2:43 1/4: 75,.... 


11. Korrelationskoeffizient zweier zusammengesetzter Reihen, die aus oscillato- 
rischen und evolutorischen Elementen bestehen. 


Da “Student” diese Frage treffend dargelegt hat, konnen wir uns kurz fassen. 
Wenn wir in Betracht ziehen, da8 fiir uns die evolutorische Komponente einer 
Reihe schon dann in der Praxis verschwunden ist, wenn sie im Verhiltnis zur 
oscillatorischen Komponente so klein geworden ist, daB sie nur die 3“", 4", u.s.w. 
Zahlenstellen des Ausdruckes fiir R beeinflussen kann,so kommen wir zum SchluB, 
da8 nicht nur Komponenten, die durch eine Parabel héherer Ordnung darstellbat 
sind, sondern auch solche, denen nur transzendentale Gleichungen (z. B. Sinus- 
reihen) geniigen, beim endlichen Differenzieren eliminiert werden, Ja mehr noch, 
man kann beweisen, da8 iiberhaupt alle mehr oder minder “ glatten Reihen,” alle 
bei denen eine geniigende positive Korrelation zwischen den Nachbargliedern 
bemerkbar ist, fiir die Praxis beim endlichen Differenzieren verschwinden. Das 
verallgemeinerte Cave-Hookersche Verfahren ist daher augenscheinlich ein sehr 
universales Mittel, die Korrelation oscillatorischer Elemente aus zusammengesetzten 
Reihen herauszuschilen. Es hat aber einen Haken, auf den hier noch hingewiesen 
werden mu. 


12. Kann man aus dem Verhalten der Reihe R,, R,, R., ... Ry bestimmen, ob wir 
den Korrelationskoeffizienten rein oscillatorischer Reihen vor uns haben? “Student” 
scheint zu glauben, daB wenn irgendein R; seinem Vorginger R;_, gleich ist, wir 
es sicher mit dem Korrelationskoeffizienten oscillierender Elemente zu tun haben. 
Vor einem solchen Schlu8 ist nachdriicklich zu warnen. Wie es meine (fiir diesen 
Artikel etwas zu langwierigen) Berechnungen zeigen, kénnen zwei Nachbarkoeffi- 
zienten R;, R;, auch bei stark evolutorischen Reihen einander ungeféhr gleich 
sein, und die Wahrscheinlichkeit eines solchen Zusammentreffens ist gar nicht sehr 
gering einzuschitzen. Nur wenn wir, von irgendeinem &; angefangen, immer 
dieselbe GriBe fiir R erhalten, also Rj = Rj.= Rjz.= Rjx;=, wird ein solcher 
SchluB berechtigt sein, und je linger die Reihe gleicher R, desto wahrscheinlicher 
wird dieser SchluB. 













STATISTICAL NOTES ON THE INFLUENCE 
OF EDUCATION IN EGYPT. 


By M. HOSNY, M.A., B.Sc. 





The statistical returns for Egypt are—as compared with European data—still 
in a somewhat elementary stage. Age-distributions are of very little value, and 
in the case of infantile mortality we have only information for certain towns. 
Further, in the larger towns there is a considerable cosmopolitan element, which 
gives them a widely different character from the often sparsely populated rural and 
desert districts. Education is not compulsory, and schools and literacy are largely 
confined to Cairo, Alexandria and the Canal Government, even when we exclude 
all foreign scholars. In the same way criminality* preponderates, in an inverse 
order it is true, in these three districts, but it is not absolutely certain whether this 
is due to their more efficient policing, to the presence of more foreigners, or to a 
real absence of crime in the rural populations, Crime does not appear to arise in 
Egypt from poverty or drunkenness, two of the main factors of its origin in 
Western Europe. The criminal, indeed, is rarely habitual; he is an amateur, 
rather than a professional, and criminals are more often well-to-do, their crimes 
arising from motives of revenge or passion. 


The fact that criminality in Egypt is highly correlated with literacy and 
scholarship would be noteworthy and might possibly be used as an argument 
against education, did not the association of crime and education arise from the 
prevalency of both in the more populated districts, where again we find the 
greatest abundance of foreigners. Naturally such questions arise as: 


(i) Are the foreigners—and if so, which section of them—to any extent 
responsible for the prevalence of crime in the districts frequented by them ? 


(ii) If we allow for urban conditions, will there still be found a high asso- 
ciation of crime and education ? 


It is perfectly easy to obtain from the Egyptian Census—we used that of 
1907—the number of foreigners of each denomination in the various Egyptian 


* We understand by “ criminality ” in this paper, not commission of but conviction for crime. 
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governments. The only difficulty here was the presence of British troops in 
Cairo and Alexandria, which placed that nationality in an anomalous position. 
These were estimated approximztely and subtracted. The following groups of 
foreigners were then dealt with: (a) Ottomans, (b) British subjects, French, 
Austrians, Germans and Russians*, (c) Greeks, (d) Italians. The Greeks and 
Italians were separated from the general European group (b), because they 
are largely differentiated, the Greeks being frequently small traders and the 
Italians often manual workers. Their large numbers also justified a separate 
classification. 


Table I gives the foreigners per 10,000 in the 17 Egyptian districts we 
were able to deal with. It will be noted that the Greeks far outnumber other 


TABLE I. 


Foreigners per 10,000 and Population per sq. kilometre. 


Europeans other Population | 
Governments | Ottomans | than Greeks and Greeks Italians per sq. 
| Italians kilometre 

Cairo aia a ee 153 312 298 204 6060 
Alexandria 7 coe | Ba 514 745 482 6780 
Canalt ... cau ae 116 583 846 145 7666 
3eherat ... bea ae 13 23 3 11 178 
Charkieh ms sie 29 5 24 l 257 

| Dakahlieh and Damietta | 18 5 18 3 346 
Gharbieh om eS) 2 0) g 0 226 
Kalliuhieh Sa ae 8 3 13 2 4169 
Menufieh ~ ee 2 l 7 0 618 
Assiutt ... bee ne 1 z 3 l 454 
Assuan ... aoe ae 8 5 13 4 533 
Jeni Suef at as 15 6 9 l 351 | 
Fayoum ... 10 3 4 0 255 
Gerga ... pips ove 2 0 2 0 532 

| Guizeh ... ew ees 6 6 4 3 447 
Kenat ... oe oe 5 4 4 2 339 
Miniat ... ars “ote 10 5 7 1 458 


foreigners, but that all foreigners are concentrated in the Cairo, Alexandria and 
Canal governments. 


It was far more difficult to obtain a measure of urban conditions. We had 
to take very rough measures of the density of the population, because the limits 
of certain areas are too vaguely defined to be of any service. El Arish has been 
excluded from the Canal district, Suez and Sinai have also been excluded as there 
is no enumeration of them with respect to criminality, literacy and scholarship. 
These densities, with such value as they have, are given in the last column of 


Table I. 


* The contributions from other smaller nationalities were omitted. 
+ Various approximations and omissions occur in these cases in obtaining density. 
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Table II provides the number of male criminals per 1000 of the male popu- 
lation, the literacy or number of male persons able to read and write per 100 of 


TABLE II. Educational and Criminal Indices. 





Governments 


Cairo 

Alexandria a 

Canal and El] Arish 
Behera ... avs 
Charkieh ies ba 
Dakahlieh and Damietta 
Gharbieh ca 
Kalliuhieh 

Menufieh 

Assiut 

Assuan 

3eni Suef 

Fayoum ... 

Gerga 

Guizeh 

Kena 

Minia 


yp eg per 100 males 


Male 





12°90 28°03 
14°15 30°09 
22°30 23°39 
5°30 9°29 
4°20 9°09 
4°35 8°18 
5°65 8°22 
6°65 8°13 
3°85 8°45 
5°45 7°01 
4°20 7°68 
5°70 8°42 
6°85 6°54 
3°95 5°84 
10 6°38 
3°45 5°34 
9°20 7°25 


Literacy 


| 
30°20 
19°99 
8°54 
1°01 
1°66 
1°76 


Male Scholars @ 
5—19, per 100 : 
| boys of those ages 


TRAC Ie iat ioe 


the male population*, and the number of male scholars aged 5 to 19 per 100 of 
the native boys of those ages t. 


We shall use the following 


Tables I and II: 


O = Ottomans, 


G = Greeks, 


symbols to denote the factors which occur in 


I = Italians, 


E = Europeans other than Greeks and Italians. 
C=Criminality, JZ = Literacy, 


Each government was treated as of equal weight, although the populations 
vary from 233,000 in Assuan to 1,485,000 in Gharbieh. 


S = Scholarship, 


and product-moments were found without grouping. 
obtained : 


Me 
my 
Mg 


Mp 


Yorrelations : 


Tpe 


Means Standard Deviations 
=7015, oo = 4791, 

= 11-019, o, = 7641, 

= 5031, oa, = 1°735, 

= 1528, op = 24755, 
=+°9614 + 0124, rps = + 8097 +0566, 


* Egyptian Census, 1907, p. 99. 


+ Foreign male scholars are excluded in the case of Cairo, Alexandria and the Canal. They have 


no sensible numerical existence elsewhere. 


tique de VEgypte, 1912, pp. 95 and 135. 


Criminals and scholars are taken from the Annuaire Statis- 


a 

D = Density of Population. 
The standard-deviations 

The following results were a 

Fi 


Correlations 
Tor = +°8450 
Pog = + 6242 
ris = +°9028 


Tpr = +9563 + 0138. 
Now at first sight these results would seem to indicate a very bad influence of 


education on crime. Where literacy and scholarship are greatest, there criminals 


+ 0468, 
0999, 


+ 
+ 0303, 












Sp aN oe eS eatin ae! 





ese ee 








M. Hosny 283 


are most numerous! And a superficial argument might be used to condemn the 
character of education in Egypt, or education in general. But it will be clear 
on examination of the isolated values that the observed high correlations arise 
solely from the urban character in Egypt of both criminality and education. We 
have endeavoured therefore to correct this by finding the partial correlations for 
constant density of population. 
There now result 
ples = — 9554 + 0143, 
pTor = — 9231 + 0242, 
pTis = +°7480 + 0721. 
Thus, while there still remains a quite considerable relation between the prevalence 
of literacy and scholars for constant density, we find that for a constant degree of 
urban conditions, the greater the literacy and the greater the amount of education 
the less will be the criminality. The negative correlations are now even higher 
than the uncorrected positive ones and of course are markedly significant. While 
admitting the slender nature of the Egyptian data, we think that this swinging 
over of the relation of crime and education when we correct for density is sug- 
gestive, and it would be of interest to work out similar correlations for states 
in which the statistics are of a more ample ‘character. It does, however, appear 
reasonable to assert that there is no evidence to indicate that education leads 
to criminality—rather the reverse—in Egypt. 
We will next consider the influence of the presence of foreigners in Egypt. 
We find: 


Means Standard Deviations Correlations 
Mo= 99°53, o>) = 195°60, Too = +°8425 + 0473, 
Mp= 86°88, on= 183°70, reo = +°9546 + 0145, 
Me = 119°41, oqg= 256°61, Too = +°9429 +0181, 
m,= 68°24, o,;= 152°04, rro= +9192 + 0254. 
Correlations: 
no = +9575 + 0136, ron = +9844 + 0050, 
rng = +9491 + 0162, rpp= +9617 +0123. 


Here, if we judged by the raw correlations only, we must assert that the corre- 
lations of crime with the presence of foreigners are so high, that the foreigners 
must be corrupting the Egyptian population. But again the association only 
arises because the criminals and foreigners are bo‘! prevalent in the big towns. 
If we correct for density of population, we find the results are very different. Thus 
we have: 

poo = — "9811 + ‘0061, pn = +1692 +°1591, 

pac = +3524 + 1433, prc = — 0713 +1628. 


It is now obvious that the correlation of Europeans other than Greeks and 
Italians with criminality has become insignificant having regard to its probable 
error; the correlation of the presence of Italians and criminality is now negative, 
but less than its probable error. Thus of Christians only the presence of the 
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Greeks may possibly, but not certainly, be detrimental. The Ottomans have 
now a large negative correlation of a quite significant character, or we might 
assert that the presence of Ottomans tends to diminish criminality. The Greeks 
are frequently moneylenders and alcohol dealers, and the Ottomans, especially 
the Arabs, have among them a good many religious teachers. 


We have, however, to note that criminality is greatest in the Canal Govern- 
ment, where Europeans and Greeks are most frequent, while the Ottomans are 
most numerous in Alexandria, where crime is almost 40% less than in the Canal 
Government. ‘To test the influence of the three densely populated governments, 
we put the Canal proportion of the Ottomans at Cairo, that of Cairo at Alexandria 
and that of Alexandria at the Canal. There resulted: 

Toc = +°9707, instead of + °8425, 
’'po = + ‘9870, instead of + 9575, 

leading to proc = + ‘4918, 

or we may safeiy say, that if the proportions of Ottomans at Alexandria and along 
the Canal were interchanged, then no relation between the presence of Ottomans 
and the absence of criminality would exist, indeed the relation would probably be 
reversed. The prevalence of the Ottomans in Alexandria has been attributed to its 
more temperate climate. There is certainly a large Ottoman element in Alexandria, 
there being 21,827 Ottomans out of a population of 332,246, and it is larger 
than any other foreign element except the Greeks. In Cairo, with 29,516 out 
of 654,476 inhabitants, the Ottomans exceed any other single foreign element. 
It is conceivable, therefore, that they may be able to influence the moral tone of 
those towns. It must be borne in mind, however, that crime is far more frequent 
in the Cairo and Alexandria governments than in the more purely rural districts, 
and we can scarcely suppose that Cairo and Alexandria would reach the still higher 
criminality level of the Canal, were it not for the presence of the Ottomans. In 
the Canal Government there are exceptional conditions, and we can hardly assume 
that a transfer of the Ottomans from Alexandria to the Canal would interchange 
their proportions of criminality. Greeks no doubt flock to the Canal for business 
purposes, the other Europeans largely for control purposes; the Ottomans, 
relatively speaking, avoid it. Without further analysis it would not be possible 
to assert definitely that the presence of Ottomans reduces crime. It may be 
doubted whether the presence of foreigners, with the possible exception of the 
Greeks, is really associated with the extent of criminality in Egypt. 

A further investigation was undertaken in regard to the possible influence 
of education on infantile mortality. The birthrate and deathrate in Egypt are 
both remarkably high. Thus for the years 1899-1909 inclusive the average 


rates were: 
Births per 1000* Deaths per 1000 Excess of Births over Deaths 


Cairo 40°7 35°7 +50 
Alexandria 380 31°8 + 62 


* §till births not included. 
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Many European towns with half the above birthrates have considerably greater 
excesses of births over deaths. 


Unfortunately the infantile mortality is only recorded in Egyptian towns and 
not in the governments at large or in the rural districts. We are obliged, there- 
fore, to deal with these only when considering the relation of education to infantile 
mortality. From p. 49 of the Annuaire Statistique de l Egypte, 1912, we obtain 
the infantile mortality for 1911, and note at once how extraordinarily high it 
stands. From p. 286 of the Census of Egypt, 1907, we take the percentage of 
male literates in the total male population, and from the Statistiyue Scolaire, 
1912-1913, p. 74, the percentage of scholars in the total population*. As it 
was possible that the density of town population might influence the results, we 
took the number of persons per house, which was about the only social factor 
available. This will probably represent fairly closely the average size of family. 
This was taken from the Census, 1907, p. 286. The mean, 564 persons per 
house, suggests that the average number of living children can hardly exceed 
three. The marked relationship that occurs in European towns between gross size 
of family and infantile mortality cannot be satisfactorily tested on the Egyptian 
data, because we cannot ascertain the infantile mortality in each size of family. 
The number of persons to the house is indeed rather a measure of net than 
gross family, and we only know this as an average value for each town. It does 
not follow that a town with a low number of persons per house is one with 
small gross families; the low number may be due to the heavy infantile mortality 
itself. Accordingly the correlation between persons per house and infantile 
mortality is not necessarily even a measure of the influence of overcrowding on 
infantile mortality (although this is often supposed to be the case); it is con- 
ceivable that a high infantile mortality might be the source of a low number of 
persons per house, and the unravelling of cause and effect is only possible 
where we know not only the number of persons per house, but its relation to 
both the gross and net family of that house. 

Let J = Infantile mortality, Z = Literacy, S= Scholars, P = Persons per house. 
Then we have the following results : 


Means Standard Deviations Correlations 
M, = 29°3 ) cr = 7608, TOL = "1040 + ‘1500, 
M,, = 21-96, o; = 5137, rps = +°5309 +1278, 


M,= 5°569, o, = 2°951, Trg = +°0093 + °1508, 
Mp = 5°643, op = ] ‘428, 
Correlations : 


pp = + 1675 +°1487, rp, =— "3421 $°1487, rpg = +0296 + “1508. 

* The scholars were taken for 1910-1911, the year of infantile mortality, but this involved 
the assumption that the foreign scholars were the same in numbers in 1910-11 and 1912-13, probably 
not a very inaccurate assumption, which in any case affects little more than Cairo, Alexandria, Suez 
and Ismailia practically. It is the number of Egyptian scholars that is rapidly changing and the 
scholars dealt with in our ratio are Egyptian only. 
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TABLE III. 


Infantile Mortality. Persons per House and Education. 








| 

Infantile Male Literacy ‘cholars | Persons | 

Town Mortality per 100 of per 106 of per 

per 100 births population population | House 

Saive Dae) eee eS ye eT eats Tastee 

| | 

Cairo... ave 32°9 28°03 | 6°12 4°62 | 

| Alexandria ase 26°9 | 30°09 3°41 8°43 

| Damietta a) 18°1 | 708 | 1:89 696 

Port Said ae 21°0 | 24°13 2°64 4°14 
Ismailia ... a 16°0 28°05 | 1°22 4°10 

glee Sa 26°9 } 274 | 064 | 385 | 

OR, ges nse 29°6 4°87 ae 
Zagazig ... ... 27°9 6°47 | 6:07 
Tantah ... aoe 29°6 9°45 5°18 
Mansorah age 21°4 | 6°20 4°60 
Chibine El Kom 16°1 5°10 4°92 
Damanhur sks 27°5 | 3°55 | 7°27 
Guizeh ... a 35°6 11°31 6°12 
Fayoum ... ‘ae 40°1 4°78 9°34 

Beni Suef weal 37°1 7°19 511 

Minia oo8 orl 38°2 8°89 4°96 
CT ee 33°6 11°48 6:00 
Sohag... — 29°0 5°58 6°15 

Kena... ee 37°4 4°80 5°14 | 
5°78 4°09 





Assuan ... ie | 41°1 


It will be clear from these results that there is no significant relation between the 
literacy of the male population and infantile mortality. There is also no significant 
relation between the number of persons to a house and the number of scholars, 
ie. it does not appear to be the more crowded towns which have the largest 
percentage of scholars to the population ; Alexand:ia and Damietta, for example, 
have considerably more than the mean number of persons to the house and 
relatively few scholars. On the other hand a larger number of literates marks 
less crowding. Crowding and infantile mortality are slightly related, but con- 
sidering the probable error, not with definite significance *. 


While literacy has no relation to the infantile deathrate, it is noteworthy 
that there is a significant correlation (+°5309 +1278) between the number of 
scholars and the infantile deathrate, which is greater where there is more education. 
Now this either suggests that many scholars mean large families and large 
families correspond to increased infantile mortality, which is usual, or that the 
towns in which there are the classes who educate their children have a higher 
infantile deathrate. The only means, and those inadequate, of testing the first 


* This agrees with the result for overcrowding and infantile mortality in English manufacturing 
towns, where the correlation is very small and sometimes has one sign and sometimes the other. 
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assumption are to take the partial coefficient between scholars and deathrate for 
constant number of persons per house. 


We find*: pry = +5336 4 °1107; 
similarly pry, = — ‘0504 + 1543. 


There is thus a slight increase in the relation of scholars to deathrate when 
we take a constant number of persons per house, and it is hard to believe that 
the relation is indirectly due to size of family. The second result shows that 
literacy has no relation to the infantile deathrate. Towns like Alexandria, 
: Damietta, Port Said, Ismailia, and to a less extent Suez, with a low infantile 
deathrate have a low education rate, and towns like Cairo, Quizeh, Beni Suef, 
Minia, and Assiut, with high infantile deathrates have high education rates. The 
first towns are on the sea or the canal, the second in the Nile Valley; it is con- 
ceivable that the latter are the more unhealthy for the infant; it would need 
special local knowledge to explain why education has been most accepted above 
Cairot. There doves not, however, seem any relation between ignorance, as 
measured by literacy, and a heavy infantile mortality, nor on the other hand can we 
assert that education and European influence have certainly increased criminality. 





* The value of pr., is+ °0207 + -1508, and is therefore not significant. 

+ It is noteworthy that there is no relation between literacy and number of scholars, i.e. education 
of children does not appear to follow the power to read and write in their parents, that is to say 
if we judge by the averages in towns and not by individuals. 











HEIGHT AND WEIGHT OF SCHOOL CHILDREN 
IN GLASGOW 


By ETHEL M. ELDERTON, Galton Fellow, University of London. 


In 1905-6 an enquiry was made in the Public Schools of the School Board for 
Glasgow as to the height and weight of all the scholars, the occupation of the 
parents, the number of rooms occupied etc. By permission of Sir John Struthers, 
of the Scottish Education Office, these schedules were most kindly placed at the 
disposal of the Galton Laboratory. 

The number of children concerning whom the enquiry was made is over 
seventy thousand of ages 5 to 18 years. The schools from which these children 
came were divided into four groups according to the district in which the schools 


were situated. 


Group A comprised schools in the poorest districts of the city. 


ae * e in poor districts of the city. 
mi + % in districts of a better class. 
Ee » in districts of a still higher class with which are 


included four out of five Higher Grade Schools. 


The data were originally used by the Galton Laboratory with the object of 
discovering how far the physique of school children, judged by their height and 
weight, is affected by the occupation of the father and the employment of the 
mother. With this end in view the necessary data were entered on cards. 
Children over 14 were excluded and all children who had not both parents alive 
were also excluded; this left us with 30,965 girls and 32,811 boys. 

The object of the present paper is to ascertain what is the average weight of 
a child of a given age and a given height. 


The first step in this enquiry was to sort the cards and form tables giving the 
distribution of weight for each height at each age in each school group, and this 
laborious work was carried out very largely by Miss Augusta Jones; this step 
necessitated making 72 tables, and she is responsible for 58 of them while the 
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remaining 14 are due to Miss H. Gertrude Jones; I have to thank most heartily 
these colleagues for their very efficient help in this matter. 


The three factors with which wé are concerned are the age, height, and weight 
of school children. The instructions issued to the teachers in the schools for 
recording these three facts were that ages were to be given to the nearest year, 
weights to the nearest pound, and heights to the nearest quarter of an inch*. The 
method of recording ages is very important. Ages being recorded to the nearest 
year, this means that children classed as 6 years were from 5°5 years to 6°5 years; 
and the average age of this group was 6 years; this is not the method most 
frequently employed for recording ages; “age last birthday” is generally used 
and if “age last birthday” be given as 6 years then the children of that age are 
from 6 to 7 and the average age of children in this group is approximately 6°5 years. 
It will be seen at once that a comparison of weights and heights of two groups of 
children of 6 years cannot be undertaken until we know which method of recording 
ages has been adopted. The height and weight of these Glasgow children have been 
compared by Dr Leslie Mackenzie and Captain Foster+ with the height and weight 
of children as given by the Anthropometric Committee of the British Association 
and it is pointed out that at each age the average weight of the children is 
uniformly below the “standard of the Anthropometric Committee,” and that 
generally speaking the same thing applies to height. As a matter of fact this 
point as to age has not been noticed by these writers and children whose average 
age is 6 years in Glasgow are compared with children whose average age is 
65 years, naturally the younger children are shorter and lighter. There is further 
Y an important question to be asked: Which standard of the B.A. Anthropometric 
Committee ought to be selected? To this point I return below. 





As I have said the Glasgow children’s ages were recorded to the nearest yeart, 
but the Anthropometric Committee recorded age last birthday, and before these 
children can be compared the six months extra growth must be allowed for. This 
is quite easily done by finding the regression of height and weight on age and 
adding half the regression coefficient to the height and weight of the Glasgow 
children. We have found the regression for children of 5 to 14 inclusive to be as 


follows : 
Boys Girls 
Regression of Weight on Age rhe 4564 4916 
si Height on Age ae 1-807 1-937 


* It is not known what record was made when an exact half year, an exact half pound or an exact 
quarter inch occurred. 

+ Report on the Physical Condition of Children attending the Public Schools of the School Board for 
Glasgow, by Dr W. Leslie Mackenzie and Captain A. Foster. Wyman and Sons, 1907. 

+ The actual wording of the Glasgow direction to school teachers runs: “In recording age, disregard 
months and record to nearest year; thus 6 years 7 months record as 7 years, 8 years 3 months 
record as 8 years.” It is not clear how 6 years 6 months would be recorded; we have assumed as no half 
years are entered in the schedules that an exact calculation was made in the case of each child of 
doubtful age to ascertain whether it was or was not past the half year. 
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This means that we must add 2°28 Ibs. to the weight of the Glasgow boys and 
‘90 inches to their height and 2°5 lbs. to the weight of Glasgow girls and ‘97 inches 
to their height before we can compare them with the Anthropometric Committee’s 
standard. The Glasgow children still fall below the “ Anthropometric Committee’s 
Average” but not to the appalling extent shown in the diagram at the end of 
Dr Mackenzie’s and Captain Foster’s Report. Personally I should hesitate to 
compare actual height and weight of Glasgow school children with the so-called 
Anthropometric Committee’s standard. The so-called standard is taken from the 
Final Report of the Anthropometric Committee of the British Association, 1883. 
In Tables XVI—XIX, the average heights and weights at different ages of males 
and females of different classes of the population of Great Britain are given. For 
example in the case of stature we have four classes: Class I, Professional Classes, 
Town and Country, 10,739 individuals, ages 9 to 60; Class II, Commercial Classes, 
Towns, 5472 individuals, ages 8 to 60 (5 below 8 are of no service for means); 
Class III, Labouring Classes, Country, 8727 individuals, ages 3 to 70 (8 below 3 
are of no service); Class IV, Artizans, Towns, 126,236 individuals, ages 3 to 60, and 
451 babies at birth. All these data are pooled and the column headed “ General 
Population, All Classes, Town and Country,’ and it is this “General Population 
which is so frequently cited by various medical authorities, including Dr Leslie 


” 


Mackenzie and Captain Foster, as the Anthropometric Committee's “standard.” 
What they understand by such a “standard” it is impossible to say. It does not 
represent the “General Population” of Great Britain, but the total population 
measured by the Committee. In this all the babies are artizan babies, there 
are only 8 children from 0 to 2 and these belong to the labouriag rural classes, and 
there is no professional class contribution until after 9 years of age. Then the 
various age groups are made up from various social classes in proportions which 
bear no relation whatever to their actual proportions in the kingdom at large. For 
example, the average height of lads of 18 is determined from 1724 of the pro- 
fessional, 62 of the commercial, 148 of the rural labourer, and 371 of the town 
artizan classes! It will be quite clear that a “standard” reached in this way 
means absolutely nothing at all, and yet this is the “standard” which, attached to 
numerous weighing machines is posted in innumerable public places up and down 
this country. It does not in the least represent any “General Population” of 
Great Britain. To be a standard of the general population each class should have 
been properly weighted, and this cannot be done as in certain classes certain ages 
are quite inadequately represented, or not represented at all. There is in fact no 
such thing as an “Anthropometric Committee's standard” for either height or 
weight. The only thing that is possible is to compare the corresponding social 
class in that Committee’s measurements with the measurements under considera- 
tion. In the case of Dr Leslie Mackenzie’s and Captain Foster’s data, this is 
undoubtedly the Class IV, “ Artizans, Towns.” Such a comparison is made in the 
accompanying diagrams. It will be seen that the Glasgow children as far as 
height is concerned are the equals if not the superiors of the Anthropometric 
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Committee’s artizan class. In weight they appear to be somewhat less, but here 
Dr Leslie Mackenzie and Captain Foster have overlooked the fact that the Glasgow 
children were weighed without boots; but the British Association Committee weighed 
in ordinary indoor clothing, i.e. with boots or shoes on. Now girls’ boots weigh 
as much as 11 to 21 Ibs. and boys’ bodts 13 to 34 lbs.* Hence in comparing 
children in Glasgow with those six months older, Dr Leslie Mackenzie and Captain 
Foster have dropped 21 lbs. in weight, while in comparing children without boots with 
those with boots they have dropped another 1} lbs. to possibly 34 lbs. We should 
anticipate therefore that their readings would be 3} to nearly 6 Ibs. too small t. 
There is in our mind very little doubt that the weight of the Glasgow children is 
at every age equal or superior to the weight of the artizan children measured 
by the British Association Anthropometric Committee and the statement of 
Dr Leslie Mackenzie and Captain Foster that “at each age from 5 to 18 the average 
weight of the [Glasgow] children is uniformly below the standard of the Anthro- 
pometrical Committee}” arises from their having entirely overlooked the con- 
ditions as to class, age and manner of weighing which were adopted by that 
Committee, a knowledge of which was essential to any comparison with the Com- 
mittee’s data. In the diagrams on pp. 292-3 we have given the Glasgow measure- 
ments set against those of the artizan class of the Anthropometric Committee, 
and the reader will see clearly how all the arguments based on differences between 
the Glasgow and the “ Anthropometric standard” fall at once to the ground. 
There is nothing exceptional in the Glasgow data, they differ of course from data 
for the children of the professional classes, but this difference is not confined to 
Glasgow. Apart from this point it is essential that the ages of the two groups of 
children should be the same and not differ by six months. 

In the data used for this paper, children of 5 were omitted; they are few in 
number and are not therefore likely to give such reliable results when each age 
group is used separately. The mean weight for each height in inches was then 
found and the regression equation calculated. These equations are given in 
Table I. It will be observed from these equations that, though some irregularities 
occur, generally speaking weight increases more rapidly for a given height in the 
better school groups, at the later ages, and for girls more than boys except at 
ages 6 and 7. 

We can see from these equations that the multiple regression surface for 
weight on height and age is not absolutely planar. It can be shown that it is 

* New ‘‘tacket ” boots for girls of five in Glasgow weight 1 lb. 5 oz. falling to about 1 lb. 3 oz. 
when the tackets are worn down ; for girls of fourteen 2 lbs. 6 oz. falling to about 2 lbs. 2 0z. For boys 
of five years new tacket boots weigh 1 lb. 14 oz. falling to about 1 1b. 11 oz. when worn down ; for boys 
of fourteen the former weigh 3 lbs. 9 oz. and the latter about 3 lbs. 30z. We have to thank Dr Chalmers, 
M.O.H. for Glasgow, for this information. 

+ Many public elementary school children have great masses of metal on their boots. Undoubtedly 
the older children have heavier boots, and we can see from the diagrams that the divergence of the 
Glasgow children from the Anthropometric Committee’s artizan children increases with age. 

t Report, Scottish Education Department, 1907, p. iv. 
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Height and 


TABLE I. 











Weight of School Children in Glasgow 














Glasgow. 
Age Group A, Boys Group B, Boys Group C, Boys Group D, Boys 
| 6 | W= — 21°164+1°503H | W = —22°5764+1:5532H | W=- 24:489+1591H | W=— 25°678+1°603H 
7 | W=-21-:0654+1:°519H | W=—26°308+1°635H | W=-- 27°390+1667H | W=— 34°819+1°818H | 
8 | W=-19°38141:495H | W=-28°127+1°695H | W= 6°640+1:227H | W=— 32°7774+1°790H 
9| W=-24°65241635H | W=—32°8264+1:818H | W=— 20°6714+1562H | W=— 36°03041°883H 
10 | W= —29°589+1:768H | W=—35°6964+1:899H | W=— 42°9424+2°055H | W=—- 51°636+4+2°218H 
11 | W=—54:910+2:302H | W=—39°7254+2:005H | W=—- 43°628+2-088H | W=— 67°664+42°546H 
12| W=—-49°5134+2°217H | W= —62°890+2°476H | W=— 55°38642°337H | W=— 62°052+2°450H 
13 | W=—65-467+2°547H | W=—63°51642°511H | W=— 81°3424+2°854H | W=—- 99°908+3°160H 
14 | W= —83°784+2°888H | W=—76°749+2°775H | W= —103°66143°251H | W= —126°4464+3°633H 
| Age | Group 4, Girls Group B, Girls | Group C, Girls Group D, Girls 
dee | = = 
| | 
6 | W=-14:56141:329H | W=—15-985+1:345H | W=— 23°7284+1:551H | W=—- 27°563+1°624H 
7 | W=-19°762+1465H W=-24'1304+1556H | W=— 29°312+1693H | W=— 30°226+1°694H 
8 | W=-20°72141:503H | W= —30°6224+1:718H | W=— 29°11341691H | W=— 40°156+1°927H 
9 | W=-30°133+1°730H | W=—29°0814+1-°709H | W=— 38°6244+1917H | W=— 45:066+2°045H 
10 | W=—36-478+1°878H | W=—38°866+1:925H | W=— 46:2634+2-088H | W=- 62°015+2397H 
11 | W=-48°707+2°153H | W=-—43-005+2°:034H | W=—- 51°146+2209H | W=- 57°7544+2°339H 
12 |. W= —58:27742°360H | W=—63°908+2:465H | W=— 77°31642°735H | W=— 84:298+2°859H 
13 | W= -74°15642694H W=-—88-043+2-939H | W=- 83:960+2°892H | W= —103°59443:229H 
W= -95°4644+3°084H W=-—84:4964+2:906H | W=—106°1334+3:317H | W= —134:197+3°804H 





| 
! 


14 | 








W is weight in lbs., H is height in inches. 


The ages are central ages, and to obtain the weight corresponding 
the child should be taken to the nearest whole year. 


weight on age and not height on age which is non-linear. 
linearity is not great, but Mr H. E. Soper, in order to smooth the material, fitted a 


to a given height 


The departure from 


parabolic surface to the regression surface of weight on height and age. 


Let W be as before the weight in lbs, H = height in inches, and y equal the 


age of the child measured from 10*. 


W 


Then 
pea dg, (y) + bo (y) H 


is the form of the surface when the relation of W to H for a given age is sensibly 


linear. 


Mr Soper now assumed : 


$: (y) = a + HyYt+ay’, d.(y) = by + by + bay? 
and determined a), a, and az, b, 6, and b,, so that: 


= 
> 


{n (p(y) — dy — Hy + dey’)! = minimum, 
{n (bd. (y) — by, — b,y — bay”)} = minimum, 


where n is the number of individuals in any age group. 


* Thus y takes every value from — 4 to +4 and we have nine equations to deal with. 
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of the model gives the mean weight for the several 
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Heights of Girls of a given 


Model of Regression Surface, giving mean weight of Girls of Class B of Glasgow Schools 


Age. 


Plate XIX 


QW REPORT , I906 


for a 


given 


Height and Age. The mean weight is the vertical coordinate and each section parallel to the front 
See p- 
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Now ¢,(y) and ¢,(y) for given y’s are the values determined in Table I for 
the + snstants at each age of the regression lines 


W=-A+B8H, 


and n is the number of children dealt with at each age. Our type equations are 


then of the form: 


Ay + 4a,% (ny) + 4a,% (ny*) = 42 (A), 


1a,> (ny) + 4a,> (ny) + 4a,% (ny*) =42 (Ay), 


ta,> 


and similar equations for },, b,, b,, with B for A. 


= (ny”) + La, (ny) + La, (ny') = 42 (Ay’), 


When these constants had been determined we put Y=10+y and obtain the 


equations given in Table II. 


TABLE II. 


Glasgow. 


W= Weight in lbs. 


H=Height in inches. 


School Children. 


"= True age. 


Boys 

Group 4 W= {02181 Y?42:214 —-2554V} x H- {1:13275Y2+ 67542-14°7417F}, 
a W= {01533 Y?+1:900-"1493 Y} x H— { 83314¥2+ 53:101-— 9°8662Y}, 
ot Sat W= {03990 Y?+3-614— ‘5796 Y} x H - {2°08397 Y?+ 139-456 — 316570 F}, 
st W= {02983 Y2+42-799 —-3636Y} x H— {1:76624 ¥Y2+111:407 —24-0174 7}. 
Girls 

Group A W= {01880 Y2+1°657 —"1644Y} x H—- { 96454Y2+38-119— 9°7305Y}, 
= B W= {02081 ¥Y2+1-907 —-2043 Y} x H— {1:16330 ¥2 + 60-230 — 13-6925 VY}, 
a C W= {02315 Y24+ 2-222 — 2457 Y} x H— {1:21642 Y* + 67-626 — 14°3416 F}, 


D W= {02701 Y?2+2°385 - 


'2832V} x H- 


{156165 Y* + 87-624 — 18-9239 Y}. 


A model of the surface for Glasgow Girls of Class B has been made by 
Mr Soper. Allowing for the points based on few observations at the end of each 
regression line of weight on height for constant age, the scroll represented by black 


threads is quite a good fit to the observations represented by card sections, The 


model is, however, difficult to photograph in a manner which shows effectively the 


approximation of the thread scroll to the cut card sections. The reader should 
note that an additional thread is placed between each of the threads which 


graduate the regression lines for the different ages. 


Further eight tables (Tables a—@) have been constructed in order that the 
average weight of any boy or girl of a given height and age can be read off at once. 
See pp. 300—303. It has been stated before that the age groups in Glasgow are 
from 5°5 to 6°5 years etc. and that 6,7, 8 etc. are the centres of each age group, 
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i ca :; 
| Regression on Age | 1°914 ins. | 1°859 ins. | 1°908 ins. 
| 


Glasgow. 
Mean Height Mean Weight 
| ee age Nea Bes RU ee CR a Cae 

A B c D A | B C | D 

Bea a ee a ae 

Boys: 5'5—65]| 41:3 42°1 42°] 43°0 | 409 420 | 42°5 | 43°3 

6-5— 75 43°0 44:0 44-0 44°8 44-2 456 | 45:9 46°6 

T56—85| 45°1 45°9 46°2 46°9 48:0 | 496 50°1 51°2 

— 95 | 47:0 | 47°7 481 | 49°0 52:3 | 53°9 54°4 56°3 

9°5—10°5 48°38 | 49°5 499 | 50°9 56°7 58°4 59°5 61-2 

10°5—11°5 506 =| 5171 515 52°6 616 | 62°7 63:9 | 66:3 

1175—125 |) 5283 | 528. | 68% 54°2 66°4 | 67:8 69°71 | 70°8 

12-5—13°5 53°8 | 54°3 55:0 55°9 71°7 29 =| We |: 69 

185—14'5 | 552 | 555 | 572 | 57°7 75°6 77°3 82-2 | 83-2 
pam oe As REE: CERES eA Sh tet. 
Regression on Age | 1-800 ins, | 1°728 ins. | 1°847 ins. | 1°846 ins. | 4°305 lbs. | 4°395 lbs, | 4°772 lbs. | 4°914 lbs. | 

Girls: 55-65] 410 | 420 | 41°9 | 42°7 39°9 40°6 41°3 41°8 

65—75| 42°9 43°7 | 43°7 44°8 43-0 43°9 44°7 | 45°6 

T5— 85) 446 45°6 45°6 46°4 16°4 47°7 48:1 | 49°3 

85— 95 46°6 474 | 476 48°6 50° 51°8 52°7 | = 54°3 

9:5—10°5 48°5 49-2 49°4 | 50°4 547 | 55°8 56-9 | 58°8 

10°5—11°5 | 50°3 511 | «(512 | 52:2 59° =| «= 60°8 61-9 64:4 

11°5—12'5 | 52°4 53-0 53°3 |. 541 653 | = 66'8 68-4 70°5 

12°5—13°5 54:4 55'2 55°4 56°5 72°4 | ° 743 76°1 78°8 

13°5—14'5 | 55°8 57°1 57°0 58°7 76°8 81:°3 | 83-0 89°0 
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but since frequently the centres are at 6°5, 7°5 etc. we have constructed Table IIT 
which enables anyone to find mean height and weight at any age between 5°5 and 
145 years. The regression lines are calculated from the original tables in which 
children of 45 to 5°5 years were included. The regression lines omitting the 


TABLE III. 
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1-943 ins, | 4°51 Ibs. | 5-083 Ibs. | 4-944 Ibs. 
| 


| ‘ Picyt ara 





children of 4°5 to 5°5 years were worked out for height on age and weight on age 
for boys in Group A, and were found to be 1°81 instead of 1°80 for height and 
439 instead of 4°31 for weight, but such differences are not great enough to 
matter and the remaining regression coefficients were not calculated with children 
of five years excluded. 


In connection with the tables (1 to 72, pp. 304—339) it should be noted that in 
transferring the data for boys from the original sheets to cards, ‘75 of an inch was 
included in the inch above; for example, 30°75 inches was entered as 31 inches 
and the centre of the group of 30 inches is 30°125 inches. The data for the girls 





5°489 lbs. 
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were transferred to cards much later and the simpler method was employed, and 
30°75 was included in the 30 inch group and the centre of this group is 30°375. 


Through the kindness of Dr Priestley, School Medical Officer for Staffordshire, 
we have been able to obtain the regression of Weight on Height for certain 
age groups of boys and girls in that county. Staffordshire is a county of very 
various occupations and contains an agricultural as well as a mining and factory 
population. 


The children measured are “entrants” and “leavers” and a further group of 
children, namely those from 8 to 9 were measured. The “leavers” include 
children of 12 to 14 years, “since in general the only ‘leavers’ at age 12 to 13 
are rural, and the only ‘leavers’ at age 13 to 14 are urban*.” 


The children were of the age stated, 5 and not yet 6, 8 and not yet 9, on 
January 1, 1911, but the actual day of weighing may have been any school day from 
January to December, so that a child entered as 8 may have been only a few days 
short of 10 when it was actually measured, and therefore the mean age of the 
group of children of 8 to 9 will be 9 years. “In the case of the group of leavers, 13 
to 14, no child can have been more than 14, because on attaining that age the 
children are entitled to leave school, and generally do leave. With these the mean 
height and weight in our tables refer to the true mean of the years of the group, 
viz., 13 and a halft.” We shall table to the middle of the group, namely at ages 6, 
9, 13 and 134. 


The children were weighed and measured without shoes, but in ordinary indoor 
clothes. The figures were read to the nearest quarter of an inch and to the nearest 
quarter of a pound. 


Staffordshire Children. 


| } 


} GIRLS Boys | 
Ages | aie sea ae ok | 
tecressi a Recressi g 
Mean Mean a * Mean Mean a 
leig Jeig 5: ig leig ody 

Height Weight Height Height Weight Height 
6 41°9 39°8 1°705 42°] 41°O 1*741 | 
9 A7°7 51°1 2°024 18°] 53:0 2°120 
13 16°7 78-0 3-272 55°8 75°3 2811s 
134 71 81°0 3°360 56°3 YA sex’ f 3°166 


It will be as well to compare these means with those for all Glasgow; so far we 
have not given them in this paper for ali the schools taken together but only for 
each school group. ; 


* Staffordshire County Council, Annual Report of the School Medical Officer for the Year 1911. 
J. and C, Mort, Ltd., 39, Greengate Street, Stafford, 1912. 
t Ibid. p. 25. 
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Glasgow Children. 


GIRLS 


Mean Mean 
Height Weight Height 





41°9 
47°7 
54°6 








Girls in Staffordshire are taller at ages 6, 9, and 13 than girls in Glasgow, but 
they are lighter at ages 6 and 9. We might argue from this a lack of physique in 
Staffordshire girls who are absolutely ‘7 lbs. lighter at age 6 than Glasgow children, 
and relatively to their height even more than this amount. At age 9 the absolute 
difference is less and at age 13 Staffordshire girls are heavier than Glasgow girls 
but they are 1} inches taller, and since the regression of weight on height at 
age 13 for girls is 3°272 lbs. we should expect Staffordshire girls to be 4°9 lbs. 
heavier than Glasgow girls, but they are not so much. I should hesitate to say 
that the physique of Staffordshire girls is inferior to that of Glasgow girls; the 
difference probably is one of race, but such questions must remain unsolved till we 
have a far wider range of anthropometric data than is available at present for all 
the districts of Great Britain. Boys show the same characteristics to a lesser 
extent; Staffordshire boys are taller at ages 6, 9, and 13, but they are lighter in 
weight ; at age 6 they are ‘8 lbs. lightez than Glasgow boys; at age 9 they are ‘7 lbs. 
lighter and at age 13 they are 1° ibs. heavier. Again relative to their height 
Staffordshire boys are lighter than Glasgow boys at the three ages for which a 
comparison can be made, 


Comparing boys and girls in Staffordshire we find that girls of 6 and 9 are 
shorter and lighter than boys of the same age, but at 13 and 13% girls are both 
taller and heavier. At 6 and 9 years the regression of weight on height is 
practically the same for both sexes, but at 13 and 13} the regression of weight on 
height is greater for girls than for boys; girls are heavier proportionally to their 
height than boys are. For girls of 13 an additional inch in height should mean 
3°3 lbs. more weight while for boys the additional pounds expected are only 2'8, 
while for girls of 13} we expect 3°4 lbs. increase for every inch of growth and for 
boys 3'2 lbs. increase. A comparison of the regression coefficients with those given 
for Glasgow in Table I will show that the coefficient is higher in Staffordshire for 
children of 6 and boys of 9 than in any of the school groups in Glasgow. The 
regression coefficient found for girls of 9 and 13 in Staffordshire is practically 
identical with that found in Group D in Glasgow, and boys of 13 in Staffordshire 
would seem to be most like boys of Group C in Glasgow. 
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In a Drapers’ Company Research Memoir * recently published tables are given 
showing the height and weight of boys and girls of 12 to 13 years who were 
members of the Worcestershire, public elementary schools. These tables are 
XLIV and LIX and will be found on pp. 100 and 107 of the work cited; we 
have calculated the mean heights and weights and the regression coefficient of 
weight on height as we have done for Glasgow and Staffordshire. The mean age 
of the group of children of 12 to 13 years is 12°5 years, so allowance must be 
made for the six months age difference in comparing with the Glasgow data. We 
have already given the mean heights and weights of Glasgow and Staffordshire 
boys and girls of age 13 so we will calculate what the height and weight of 
Worcestershire children at age 13 would be. An additional year makes a difference 
of roughly 1°9 inches and 49 lbs. in the height and weight of a girl and of 
1'8 inches and 4°6 lbs. in the height and weight of a boy. 


GIRLS 





Pomorren 
| of Weight | 
on Height 


| Regression | 

Mean ates ta | 
of Weight 

| on Height | 


| 
Mean | Mean 
Height 4 Weight 


Mean | 
Height he Weight 
| 
| 
} 


5) 


} 
Worcestershire | 


| 
; Sta uffordshire 
( tlasgow 


| 
| 
| 2°800 


Worcestershire children are taller than Glasgow children but slightly shorter 
than Staffordshire children. They are also rather heavier than Glasgow children 
but not relatively to their height. The height of Worcestershire children of 
125 years is the same as the height of Glasgow children of 13 years, but the 
weight of girls is 2 lbs. less and of boys is 1} lbs. less. Worcestershire children 
are lighter than Staffordshire children, but when allowance is made for the 
difference in height the Worcestershire children are not much at a disadvantage ; 
girls are a pound lighter and the weight of boys is practically the same. 


The differences we have found between the Worcestershire, Staffordshire and 
Glasgow children may well be due to differences of local race, and not be the 
results of differential environment or nurture. We should have little hesitation in 
applying the returns for Glasgow children as an approximate standard—say to the 
Ib. and inch—for all British children of the artizan classes. 


* «© A Statistical Study of Oral Temperatures in School Children with special reference to Parental 
Environment and Class Differences,” by M. H. Williams, Julia Bell and Karl Pearson. Studies in 
National Deterioration, TX. 1914. Dulau and Co., Ltd., 37, Soho Square, W. 

+ This weight appears somewhat exaggerated. It may in part be due to local differences in the 
average ages of ‘leavers.’ 
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TABLE a GLASGOW. BOYS. GROUP 
Weights for Height at each Age. 
Actual Age. 


-*. 




































Be Pee ee et Be > | 11 12 13 | I 
33 28°5 — _— —_— — aa — a 
3h 31-0 ieee ~ — — — _ — 
35 32°5 _ —_ _ _ = — ioe 
36 33° 34°3 — — _— — — — 
37 35°4 35°8 — —- — — _ —_ 
38 36°9 37°4 37°3 - - _— -- — 
39 38°4 39°0 39°0 38°4 — _ _— 
40 39°9 40°5 40°6 40°2 39°3 —_ —_ —_ 
41 40°2 41°4 42°1 42°3 42°] 41°3 — — — 
42 41°7 42°9 43°7 44°0 43°9 43°4 42°4 — — 
43 43°2 44°4 45°2 45°7 45°7 15 °4 44°7 43°6 42°1 
4A 44°6 45°9 46°8 47°4 47°6 17 °5 47°0 416 °2 45°0 
= | 45 46°1 47°4 48°4 49°] 49-4 49°5 49°3 48°7 47°9 
=; 46 47°6 48°9 49°9 50°7 91 °3 51°5 51°6 51°3 50°8 
“oO 47 49°] 5O*4 §1°5 52°4 53°1 53°6 53°8 53°9 53°7 
= 48 50°5 1°9 53°] 54°1 54°9 55°6 56°1 56°5 56°6 
49 53°4 4°6 55°8 56°8 57 °7 58°4 59°1 59°6 
50 54°9 6°2 57°5 58°6 59°7 60°7 61°6 62°5 
51 57°8 59°1 60°5 61°8 63°0 64°2 65°4 
52 59°3 60°8 62°3 63°8 65°3 66°8 68°3 
53 7 60°9 62°5 64°2 65°8 67°6 69°4 71°2 
54 64°2 66°0 67°9 69°9 72°0 74:1 
55 = 65°9 67°8 69°9 72°2 74°5 77°0 
56 - 67°6 69°7 72°0 74°5 fy he 79°9 
57 69°2 71°5 74°0 76°7 79°7 82°9 
8 - — 73°4 76°1 79°0 82°3 85°8 
59 78°1 81°3 84°9 88°7 
60 83°6 87°4 91°6 
61 — 90°0 94°5 
62 — | - -- _— 97°4 
TABLE 8. GLASGOW. BOYS. GROUP B. 
Weights for Height at each Age. 
Actual Age. 
6 ? 8 9 a ae 12 13 14 
Sn ors |— | eS alas 
33 27°5 — — re sae as a — — — 
Bh 29°0 = = = Bes AS = —_ 
35 30°6 31°4 —<e = — = as 
36 32°1 33°0 — =a — so 
37 33°7 34°6 34°9 —- — — — 
38 35°2 36°2 36°6 _— — — — — 
39 36°8 37°'8 38°3 — _ _ a — — 
40 38°3 39°4 40°O 40°] 39°8 —-- —_ — 
41 39°9 41-0 41°7 41°9 11°8 — = —- — 
42 £1°5 12°6 43°3 43°7 13°7 43°3 —_ _— — 
43 43°0 44°2 45°0 45°5 45°7 45°5 — _ _— 
4d 14°6 15°8 46°7 47°3 47°6 17°6 = — =a 
P 45 46°] 17°4 48°4 49°] 49°5 49°7 49°5 —- - 
a 16 477 19-0 50°1 50°9 515 51°8 51°8 516 — 
car | se 49°2 90°6 51°8 52°7 53:4 53°9 54:2 54°2 54-0 
one 48 50°8 92°2 53°5 54°5 55°4 56°0 56°5 56°7 16°8 
ie 49 92°4 53°8 55 °2 56°3 57°3 58°1 58°8 59°3 59°6 
50 53°9 95 °4 56°8 581 59°2 60°2 61°1 61°8 62°4 
51 7 0 58°5 59°9 61°2 62°4 63°4 64°4 65°3 
52 58°7 60°2 61°7 63°1 64°5 65°7 66°9 68°1 
53 61°9 63°5 65°1 66°6 68°1 69°5 70°9 
54 — 65°3 67°0 68°7 70°4 72°0 73°7 
55 — 67°1 68°9 70°8 72°7 74°6 76°5 
56 = 709 72°9 75°0 77°1 79:3 | 
57 72°8 75°0 77°3 79°7 82°1 | 
58 - 771 79°6 2°2 85°0 | 
59 79°3 81°9 84°8 87°8 | 
60 — 84°3 87°3 90°6 | 
61 Pte 89°9 | 93:4 | 
62 - — - 92°4 | 962 | 
63 —-|- _ _ — -- | — | 950 | 99°0 | 
* Throughout weights are given in lbs,, heights in inches. 











E. M. Evprerton 









TABLE y. GLASGOW. BOYS. GROUP 0. 
Weights for Height at each Age. 
Actual Age. 






























40 38°4 40°5 41°7 41°8 eg ea — — — 
41 40-0 42-0 43°2 43°5 a — — _ _ 
| 42 415 43°5 44°7 45°] 44°7 — — — — 
| 43 13-1 | 45-0 46°3 46°7 16°5 — — — 
| bs 44°7 46°6 47°8 48°4 18°3 A7°5 = = o 
| 45 46°2 48°] 19°3 50°O 0°1 19°6 48°5 = 
| 46 17°8 49°6 50°8 51°6 51-9 51°7 50°9 — — 
4 19-4 51] 524 53-2 537 53°7 53°3 _— — 
E 48 51°O 52°6 53°9 p4°9 55°5 55°8 55°7 D5°*4 —- 
=| 49 - | 54-1 554 56°5 57°3 57°9 58°2 582 oo 
be} 50 — | 556 | 570 | 581 5971 99 | 606 | 610 — 
2] 51 7 | 58°5 59°8 60-9 62-0 63-0 63°8 64°6 
| 5 aay 600 | 61-4 | 62-7 | 64:1 65-4 | 667 | 67:9 
| 53 Fi | S 61°6 63:0 64:5 66°1 67°8 69°5 71-2 | 
54 — a 63°1 64°7 66°4 68°2 70°2 72°3 74°6 | 
| 55 ae Le = 66°3 68°2 70°3 72°6 75°1 ve ae 
| 56 | ¥ 700 | 72:3 | 75-0 | 77-9 | 81-2 | 
| 57 = = : = = 74:4 77°4 80°8 84:5 | 
| 58 yess some — 76°5 79°8 83°6 87°8 | 
59 ‘a = ~ _ 82°2 86°4 | 9172 | 
60 = Seas cs : 84°6 89°2 | 94°5 | 
61 : = — a4 | S73 I 
| 6 : 94-9 | 101-1 | 
63 =s — 97°7 | 104-4 | 
| 64 — =< ai 107°8 | 
65 = * lll | 
66 = 114°4 | 
67 Lies. 3 
TABLE 6 GLASGOW. BOYS. GROUP D. 
Weights for Height at each Age. 
Actual Age. (eee aS = ¥ 
6 7 8 9 10 11 12 13 14 
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GLASGOW. GIRLS. GROUP C. 
Weight for Height at each Age. 
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6 7 | ae tee 10 11 12 13 1h 
ea SI | a 
soo | — | — | — = 5 = ee eg 
ee ee eee o— | — — — — |; — 
33°1 | ee eee = = —< on ae hee 
34°7 35°3 | 35°3 ae — ~~ — — 
36°3. | 37:0 | 37:0 = = oe ame = — 
37°9 38°6 | 38°8 383 | — — — — | — 
39-4 | 402 | 405 | 402 | — a i a oS 
410 | 419 | 422 | 42:1 | 4155 oe $i wr | = 
426 43°5 44°0 44-0 43°6 42°7 ae — — 
44:2 45:1 45°7 45°9 45°6 45*0 <3 a | a 
45°38 | 46°38 | 47-4 47°8 47°7 | 473 46°6 — — 
47°3 48°4 49-2 19°6 49°8 | 49°7 49°2 — | — 
48°9 | 50°] 50-9 | 515 | 51°99 | 520 | 518 —- | — | 
— | 51°7 52°7 3°4 54°0 54°3 54°4 54°3 | a= | 
— | 83°3 54°4 55°3 56°0 56°6 57°0 673 | — 
56°1 7 °2 58-1 58°9 59°6 602 | — 
bes 23 57°9 59°1 60°2 61°3 62°3 632 | 64:0 
= = 61°0 62°3 63°6 64°9 6671 | 67:3 
= Es = 62°8 64°4 65:9 67°5 69:0 | 706 
a = = = 66°4 68°2 70°1 72°0 | 74:0 
3 a - 68°5 70°5 72°7 74:9 | 77:3 
= _ = 70°6 72:9 75:3 77°9 80°6 
= mt 2 = ae 75:2 77°9 80°8 83°9 
— _ — — 77°5 80°5 83°7 | 873 
_ = . - 79°8 83°1 86°7 | 90°6 
ae = Sy = = 85°7 89°6 | 93:9 
a es 7 s a : = 92°6 | 97°2 
noe = ™ es = 95°5 | 100°5 
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= a — 2 : <2 a 107-2 
TABLE @ GLASGOW. GIRLS. GROUP D. 
Weight for Height at each Age. 
Actual Age. 
6 7 8 9 10 11 12 13 14 
29°4 — — . a = = = 
31°1 = = — — — — | 
32°7 34:0 <i : - — — — | 
34-4 | 35°7 a “i oa - a 
361 | 37:4 | 37°8 - _ — — — | 
37°7 39°1 39°6 — = — | _ — — | 
39°4 40°9 41°5 41:2 ae eee = “a 
41°0 42°6 43°83 43°3 a eee = 
42°7 44°3 152 45°3 14°7 — — — — 
44°4 16°0 17°O 17°3 46°9 — j—~— — _ 
46°0 47°8 48°9 19°3 49°2 48°4 —- - a 
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Weight of Boys of 55—6°5 years 

Mie td foal tb ee iid 

es ae ae Se SSeS SS eS Ss 
$1§—f 1.— 1 ——— — — — ee EE EE EE 2 
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Oe eae ey oe i ee fe og ea a 15 | 
| $5 4 See ae et ee ee ce er — el ee are — 21 
ea es — Baris fo €. 3 Se fe ee 52 | 
87.,, | — — 172 1615 5 1-1———11—-——— 73 | 
we ——— 11020 3 34 15 16 2 - = ee ee 134 | 
eS ee ee er ae ae ee a ee ee ee 155 | 
{Rhos (eee & 2) C628 Boe stb Se 216 | 
141, I——— l1— 1 6 @ 56 52 3116 7 1 1— —— — 194 | 
Mo Bas ce se ET SS 10 Oe ee OF 1 ot 167 
43 -————— — 5 7 2.229515 7 311———¥J] 108 
hf, t—-——— ee oe ta a ee ee Se — 50 
ye _ ——— — 1 — ] 2 4810 5 24 ——— — 37 
46 F—— — eae ee ee CU ——lUrd dO YH Ze 8 
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TABLE 33. Glasgow. Height and Weight of Boys of Group D. 
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TABLE 34. Glasgow. Height and Weight of Boys of Group D. 
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TABLE 35. Glasgow. Height and Weight of Boys of Group D. 
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TABLE 36. Glasgow. Height and Weight of Boys of Group D. 
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TABLE 41. Glasgow. Height and Weight of Girls of Group A. 














Weight of Girls of 9°5—10°5 years 
Height I I 4 I I | | I | | i] iI iI ,| s| ,| I i l EY J t | R Totals} 
2'SbesnaS S88 = 6 @e2 23 8 ef fee 

34 —F — — ei ai a ae a a ae SE Ee Ee Ee Oe 1 | 
a5 99 — —_ i— — — a —- l _ —_ - — _— —-— al 2 
36 ,, gy aye Ee ite a ee gee ee Mem Lene pa Seem 3 
37 ” —-—— l — — — —  —--= — —-_ — — ~ - — — 1 | 
38 ,, —_———— | — — | a 2 va. an nine 2 
ee rr. sed ee en we = wes iS heen) eee 4 
40 ,, ——21—1%1—-— 1 — - — — — — — — — — — — 6 
Yo ——— 1 68 4 8 a a aes cheats ecw oS reece Seabe poe Geen eam seer me al 23 
42 5, —— 113 510 7 7 11 38-— —]| —~—————-—-—--- 39 
yi ——l1l_— 3668 8 4 83— — base ee es ee ee ee SS 40 
44 55 —— 1 — 4 41815 22 12 6 4 5 1 1 — — — — — — — — 88 
45 55 ———_— 3 6 62 0 2 210 10 — —— — — Se —— — 132 
56 ,, ———_ — — 4 310 2 26 30 30 12 3 4— 1———————— 148 
He i= — ——— 1 8 23 26 40 38 26 16 13 5 2—— —— — — — 193 
48, {———_ — — — 1 8 11 9 2 380 4 32 2112 6 2 1—————— 193 
49 5, — ————— 1 5 1 18 84 @% 272411 2 31—!1——— 161 
50 ,, -———— — —— 1 — 6 6 13 2% 21131010 4 21 —— — — 112 
r) ee — —— — — — 1 |— — — 6 8 111110 8 421——— 1 58 
62, §——----— — l— 2— 118381675 6i1—-——— 2 35 
58 ,, —-— — SO -_—— Ll 2. to £86 2: 4 25 
54 5, me Oe OO 2 — 2 IL 2 i 
55 ,, as ae eget Rs ater ee ee Meee > ote i Se ge ee a a ee 3 
56 ,, — eee a ae l me ee ee 2 | 
i. {— = oO = —- —- — — 1 1’——— — — — + — 1 — 3 
58, J——— — — — — — Se a ey roe egg te age Se = 2 | 
59 ,, —_—— — — — - i. ae ne en ee eae ae at 1 | 
| 

| Totals} 1 — 5 6 22 36 45 74 131 107 147 135 153 112 100 74 48 30 2515 9 2 2 1 4 | 1284 


Glasgow. 


Height and Weight of Girls of Group A. 














Weight of Girls of 10°5—11°5 years 

se et a Be Gg ae en a Ce ee eee | Pe ae: 
| BSRSRRISS ESE SsSISISESRRSRERSRSSSERSE 

gn : 
| 36... |g Ee Sa eae Peay Oe EN NTS Se ead VE hh Oy ip ee = 
| sr” — — — — Cli I atl Se apy ge OE ES SS. Seat aie, ew seta; iene l 
We NS aS aes, “CES pas ee Bem ag She ee ai 
ES I Oe ee a een eg 5 
| 40 ” ee ae Le. ae l1—— — = = — — — ——<—_— = — — << — a SS == ] 
a oe l oS. SS ee ee ae ee pe aes Se, Oe EP ge See Se ee og aes 8 
| 72” 23132t1i—2?2— 1— - cen a A Spee te ng ~ Oe 12 
\48,§—- 1656512 38— 1 — — —e—— —— — — — — — EE 18 
lAhy, 1— 24276 & Ll SD Ym Lamm ee ee ee ee Ee Ee 30 
| 45, i nee, ae ce a i a a a a as Ss ee ae ee a a es 55 
ree ee ee Be 8 as eerie Se a a ee oe ee 
Pe Ses ee 8 sc oe: a en er or i ere ee a 
| 48, | —— 1— 4131926 56 29 15 7 9——— 1 TSS Ste Se ee 
149, J—— 2— 2 61416 24 34 23 1816 8— 3 3— ——————— 1 ——— |] 100 
150, i—— — —— — 4 8 19 8B OW M16 6 610 $— 1—-—— — — 1 —— e 
61, }|—————— 1 4 5 11 2 222718 8 8 6 2 3—2 1 —-——-———— 144 
62, I——— l 1— 1 1 — 8 4 81216 615 6 461—1—-————-—-—— 86 
wl LS Se eae ieee tt — 1 232.40 967948311—11—-—-—-=— 59 
545, | —— ————-—-—-— —— 3 2352555 4 &g———— —— — — 36 
oy f—— — — i a Pan Bo 2414 1 SS = 12 
66, |———------— —_ —- —— — — —-— 1 11ir——— 1 — — — — 1 6 
58, ——-—-— -— ao a ES sade Seah ie eee ceca, eg, eal res. eee eae ate RRS Serine estrella 1 
59 J——— — — ee ng eg ee re ag preg ene l 
Totals] 4 10 22 24 52 69 92 96 143 127 106 108 98 63 29 51 341822 5 3 2 2 3121 1— 21 41187 

















| Heig 


4 
4 
h 
4h 
ji 
4 
4 


DS Gite Ce © 


2 










| Height 





TABLE 


Glasgow. Height and Weight of Girls of Group A. 


E. M. ELpEerton 


Weight of Girls of 11°5—12°5 years 





























Height 


20 41 70 121 


TABLE 


44. 





, 
, 
) 


Glasgow. Height and Weight of Girls 


Weight of Girls of 12°5—13°5 years 








Totals 











Biometrika x 





terweereye es Lid 

FSS Db 8 & : : Sssssus8 X | 
41 §— L— 1—- — — — — —~———- —~——-————-—] 3 | 
42 ,, Lh lmr —- —- Fe ee He ee 3 | 
8, \-----— a em: cree. Sam, geen ean, cy Se: ee, eee ae es ne ec Yn 
4h 55 hl @—— —- —- eS eS He 4 | 
4D ,, — 1 l1il— ee aa ee ie 
46, T—— 152 1— — —.—————— 9 | 
BF ics 23 6 9’°4..4. 2. — eS SS OD 
(25. i<6.320° 3-3. —-=— eee es ——————-—]| 8 | 
9, I —.2 31210 62 6 2 —~ — I1— 1————————— 63 
a a — 2 5 9 26: 28. ak SS — 1 —— — — — — | 7 
St. — 312 06.9% 15 6 8—11~-———————-=—Ff 91 
52 -— 349: 22. 90: 382.22. «8. 2 7 = a ee SS ee 
5S ,, -= —- 4.11.96 29 92 13 2 3—— — — — — — — — — J 110 
D4 5, -—— 4 6 18 27 299 148 4 1—~ ————————Jlll 
55 4 a 3 12 14.18 2911 9 4 3 2 ——— — — — —}f 105 
56 ., — 1— —_ 3 8 18 18131310 11 ——————— 86 
57, |———-— i == 2 OCIS 4S ie 4 Ss SS ee 55 
58 ,, - — -— —.1 1 58368311-2.2—1——f] 34 
of a -—— — 2 2123 6—11=—.-2 ——-—]|] 17 
60 ,, — — ee Yatoe -_— == —_— — 122422———-1—_—--—_— 14 
61, J——— —- —- — 1—-—112——21— —_—— 8 | 
62, J——-——-— —- — — — — — 3——— 1 — — — — — 4 | 
63, J——--— — ee ee 1———— 1 2 | 
64,, J——-——— ——— 1 ee ee eS SS 1 
Go, — —$ i — i i — a ee ee ee ee — 1 
66 ,, — — — — lh el lh ee ee 1 
67, I—-— — — — ee he EE 1 
Totals{ 5 9 20 45 79 136 126 139 53 54 4412 9 3 6 5 1 1 — 1 *| 976 


5 J J hi J " J J J J Hl J J J J J a J >) — 
SSN ON 
| 
4yj—|—— 1—-— — — — — — ~~--—-——-———— ~~~ 
%, |—-——-— 1 —- — — — ———-— — — — 1 
8, §—1—-—-— — — — — — —-—— + | - — 1 | 
a ee ee ee ren seees | one Se a a eee ee ey en aa ee eee a ae a — | 
40 5, — | P— — i a i ier eee errar aera arr err ee 12 
Pl 55 = SS 5 SL FS FE SF 8 SS 1 
ee 4s Ne eee 2 
143, 7— 1 22 Pe — ee FHM re EE 5 
as; —— 3 2 6 2— — - _- — eer ere err reer eer err ear errr arr eee ee 13 
5 —— 3 6 5 4 2—- — — ee SS SS ee eee 20 
46 4 — 1 3 616 2 1 2— —- —- -— — L— — — — — — — — — — 32 
ae — 1 31210 18 16 3 — oer err arr eres err ern nr arr rrr are ee 65 
48 5, ——— 610 23 19 11 11 2 Le 83 
49 5, ——— 312 29 2 2 WW 8 4 5 — — — — 139 
50 4, -— 23 6 20 2 43 35 17 304— — — — — — — — _—-—_— 162 
a ———— $ 13 2 37 0 18 15261 Ve — eH] ——| 1] —— — 161 
52, —— i — 8 5 21 37 25 2620 7 1 1——— — — — — — — — — 153 
53 ,, —_ — — — — l 412 21 3 321212 7 1-1 ———]— — — — ~— = = 133 
| 54 5 —— l—— 1 — Bild M4 BWM sd §6— 1.12 — HS — = SS 92 
155 ee a 2 7 131410 5&6 58 6&6 1 Be — — — — — — 65 
56 ,, Se et eee ee 1 l 4610 32323832 1°23 ———— — 41 
ST. . —_ —  — 3 i:tfd-3 2 23s 3—3 1 —— a] 27 
58 ,, — _ - - l1l— — 1—— 2Lim— t— ti ——— ] — — 9 
I Sete Berenice sn ee a a i cer le er 
60” es het ei RE Gy a A MS ERS Hi 1 
>) See -— = — HH 1S 2 28 
67 5 J—-——-—-—-_—- cpa cas hace ak apnea: ot cass ana ae iin pps all Pipe ek 1 
Totals] 1 6 129 159 186 125 115 77 64 32198 131013 3 5 31—I1— 1 1214 








326 


Height 


Height and Weight of School Children in Glasgow 


TABLE 45. 


| r w”~) 
< See ae 
> + &S 


Glasgow. Height and Weight of Girls of Group A. 


Weight of Girls of 13°5—14°5 years 


81-5— 








t Sr Sr Sr Sr Sa Sr 
™ oD Site Co TH & 


Xo 
Oo o& 


| 5 


D D 


| ro | ores or oo oo oe | | 


| — | 
lel | crt omeel | lt | 


LT LT TL | ee error | oe | 
| | ~mtom mcrcres te | | | | | | 


| eer re | | | | | 


| m1 come | 





Totals 


Height 





TABLE 46. 


bo 
@ 
2 
S 


and Weight of Girls of Gr 


Girls of 5°5—6°5 years 





| Totals 
b>) 
ne 
ma 
— 2 
ie 3 
— 2 
— fw 
13 
— 18 
oe 33 
“= 27 
— 37 
= 37 
— 32 
— 25 
- = 20 
3 22 
— 3 
fi 2 
2 291 


Totals 








«2 
~ 


S 


~ 


tt tb 4. > S . + Oo Co 


ws) 
D2 Ore Sy 0 


© OoN 


So SD Cis Ce © 


Sri Gr 
So | 


eS 


— bd = 
wre 


[1 Ll eee 





Totals 





nee Oe 
"gee ee FS 
ee ces 
2 l 
2 l 
— 2 t 
] 2 6 
3 9 
— l 9 
l 2 3 
— l 3 
_ — ] 
4 17 37 





= 




















TABLE 47, 





E. M. Evprrton 


Glasgow. Height and Weight 


of Girls of Group B. 




















Weight of Girls of 65—7‘5 years 

sn, ge He Me Oe Cee Same Se eee ees es oe ee Ae Os a ee ee ee 

Ten) 9 wD wD Ton) Ten) 9 wD wD wD Tony 9 wD wD oD Ton) DD wD wD 

~ & um © 5 & &® Y & HB & & AH FS KK BH B 

SS Ff.) 6 8 -& 2. = = = 2 2: SS |S SS eS 
32 i— l1—- —- Se ei a a ae OTe rea ree eee ee 1 
een Se Be Ae 5 
MBM, '—- 42 ~maee—e— tre in Kee e- K- K- S 9 
$5 5, —- 3—- 7FJ—- 1 — -—- — Se KF ere er re rae ee 11 
36 4, ape kG I ey pe ee ef 1] 
a — — 383 565 38 42 1—-—-—- -—- -—- - - rrr 18 
38 ,, ee RS a eee I ee ee ee ee eee 39 
39 ,, Rs Sey: ies | ay Rey ai See eee ee ep eter Seen eee. 77 
26. = — 2 <3 50-6898 So 8 2 Se SS SS 121 
a — I — 4 10 9 41 39 2112 5 83 2— — — — — — 166 
42 ,, — ees SA 7 Oe ae SW 8 RS aS ee 201 
Ve ea ee — 1 2 2 52 57 45 23-° 7 3— 1— — — — 213 | 
bh yy — Si = — 19 6a: Ser 20 a 8: Se: oS ee Se 210 | 
55 Se es oe eo ae oe Be SO) RE ee A a ee ere 129 | 
46 ,, a ieee VN ae: ee. ee ae ee Sie i Wee ae ie oe 76 | 
| ET ge a a ek ee fe ee he eh Soke a eS 38 | 
Ae ae eee ee a oe ee ee ei ee eee, ee 13 
SS ee ees ee ee 5 
TD ee ee ee ee ee 3 
Totals 1 14 9 48 71 194 157 224 215 170 133 83 54 21 17 5 1 38 | 1346 





Glasgo 


Weight of Girls of 75—8°5 years 


w. 


Height and Weight of Girls 





Totals 





| 1 L 
C2 ee Leo 2 * 22 ee ee toe 
38353—} — — 1 — — — — — oe ag, > See a a oe ee Se ae eet ee 1 
i ee a ee i ee ee 2 
Ie —— ll i-—-—- — —_ —— wae, de dee in) Neca seer ae joe Aaa 2 
6, I—-—_—— 1—-— — — a 1 
v7. f—1138311— — —| 1 S| — — HH -—-- 8 
8, §i——— 8 23121h— — ———--— ae 13 
399, I——_— 238116 741—- — ————-—--—---- 34 
40 5, lf —— 2 & 6M kt 2 = Bsc eee tte ees eo a 45 
jl, J ——— 1 51319 2 12 7 3 2 2—-——————— 87 
42, {|_—— 1 4% 3 622 33 30 16 9 2 1——————-—— 127 
8, 4——=— 3 6 8 39 31 36 30 15 5 2 2——— = — 177 
BB. t———— 1 28 8 24-351 31 49 28 11 6— 1°1— -ee —F] 
4s, 1——— 1— 1 8 6 2.48 3 4 416 7 1— 1———| 229 
me. §————-—-—1678 8B SB BH 9 2 1--—--—-—|j 
| a oe a ee ae a FC ae SS 32 Eee 2S 132 
48, J|——— — Ss oo SF 8D UA Oo 4 2 — — 69 
ee ee ee a ee ee ee ee ee a... 
| 50, J~——————-— — — — — '!| 1143 5—— 1 16 
1, ja — — — — — a eid e821 I 4 
62, J—————-——-— —- — —- —- -— eee 1—— 1 
Ss. 1 — — — — — — ee a l 





Totals 





51 78 155 162 170 198 


159 115 


30 19 




















328 


| Height 


Height and Weight of School Children in Glasgow 


TABLE 49. 





Weight 


of Girls of 85—9°5 years 


Glasgow. Height and Weight of Girls of Group B. 


Totals 


























SHSBSRSSESSR RHE VE VE RESRESSSBISSESRKRE 
ee ER re ie EE ga ag ae Pe a pee a a a gag ee 1 
}s3, J— 1 —— — — Pe ae ee ES SS Se SE EE 2 
| 34 5 ae Rl A = ay tic eh te el Se heal ea = aR GS ee Li 
Ss a ae a ea See ak. ee Eee a ee ee 2 
| 36 ,, —— | — — — — — on sels ape a ee rege y Rae re ome Ig bys a ee eS. 1 
37 » - o-oo err err emer or - —- —_—_— eel CU ee eee te ee ee ee ee ee es ee l 
$8, J——— 2 21 il— — SS Se SS Se SS y 
39, ~———— 23 4— 1 HK eS SS — ee 10 
On F—— — — 5 671 F—— 1K Hee = 22 
7 ee * ee LO Oe ee 37 
42 5 —— — 3.71419 16 8 — — — 1———— - — — — — — — 68 
i —— ee ee ee ee ee ee 64 
44 1 —— 1 91122 28 37 21 10 8 4—— 1—— ————— — 153 
bi So — — —_ 1 2-2 10. 85 46 SS (85. 180.10 5 2 SS SS a 
a ae 1 210 19 34 50 36 23 22 3 6 1— —— — —— — — 207 
BR SSS es ey ee a eee Oe a ae, ee eee ee ee 
8 y f——— — — — 2 2 S&H 9 6 243314 9 712 ———~— — 176 | 
9, J————— 1—— — 8 8 UN 14 87% 810 4311 1—~——Ff 115 
50 ,, J—— — —-——— — — — 28 8 141017 8 21 1————— 63 
eS Se SS ES FG A 8 8 2 SS 35 
Ee ee ee a es teem. ae es ee eee ee ee ee 12 | 
(iy | ee Ser ee aig ee ne Se ee aod 2 
OS ee ee Cr re eee, Cee ee 3 
CR Se ee gi temas Di Sem mar cere ae mci ae a cae "es EE l 
| ome 0 eee come eee eee cee ae oe aoe — — — — — ==» => a= a= a= au au auee aes au eee osu — 
| 9 ,, F— — — — — — — — — — — — — Pee ee 1 
| Totals] 2 1 1 3 15 42 74 92 123 163 161 142 156 132 98 58 4423 8 8 5 4 1 2 2 1360 
TABLE 50. Glasgow. Height and Weight of Girls of Group B. 
Weight of Girls of 9°5—10°5 years 
at CEPR OEE SEE EM CET ERE ENS OE gas 
SShHSSSIRBW@Q ex 6 BR_ SVssSsSsSesSRReereRSNS 
| 363—] — 1 — — —--—- —- Se  —— — — — 1 
87, Pl —-————— — — 1 — -_—— -— a La a, 2 
| Se a i— 2. ee ee es = SEO po agen ee Dip Bh 3 
389, ~————-— 21—— 1 —- — — — 1—— — — — — — — — 5 
HO, J—-——-— 21— — — — — — — 1-- _ _——— 4 
ee. Sis OE UP RE GE. eo es a = SE pn ee ean 10 
42 ees) OS oh ee he 2 eS = ee ee faa ae pe 2] 
4S 4 naa: a a a i eee ee ee ee 1] 
pie ee ae = SES IS IS eS GS eh a ee eS 62 
MD y, ti 2.317 CO 8 BR 8 OY 3S 22 Se — —— —] 117 
46, 1— -— 2217 2 2 93 20 18 8 42— 2 —- — — — — — 142 
ae l PE BRaR DS eee a a a 8S OS tre ee et —- 194 
1. rime SO Oi 4288 1 OG OL IS A ET PS SS SS 
49 ,, _- — en eee ORES a EE Ee A OL BO EY aOR: a eh oe es 
50, {—— -—=— —- «7 38.4 8 18 6 3091 10 6 4 2 1— 1 —— =F 196 
| On as — —_— — 1 2 l l $ fw M4 Th 6 fe — 2 ——' ] 110 
52 ,, - Se eat eer ee B.S SS Soe FS 8 Se eS 60 
53, | — alaieniantientitentindtitcntieta tae Te an ee cee ee ae et ns enn Ga | 24 
154, -—— — -—— — — — — TS IH 11e8@1WH1pH ~ 
65, |———---— ——- lmnrinr——+-— @—-—2—1— 1 8 
56, |——--—-----— —-- - lr eer eee 1l12i1—— 6 
a7 5 J—— ———-— —— — — — — 821 1 4 
| Totals} 2 3 4 11 24 35 76 103 127 136 126 176 138 116 81 55 35 371911 4°7 3 1 3 | 1333 

















Se tee EE HE ES 
“2 SD Cis Lo © 


48 








TABLE 51. 


K. M. ELDERTON 
Glasgow. Height and Weight of Girls 


of Group 


329 
B. 














Weight of Girls of 10°5—11°5 years 

|Height? | | | | Papa Pak Dba ee ba) ee ee ee 

gc se ellie ae age AOR CaS Si ‘S > ry Te ge ee nee aN ge ep A Ay, BS oe, ee A a ee ee 

~ Om) H6e& Dm H S ~~ Dd ~S SrFAanNrHAHeFHSHH He DA HH 

nl 5: SS OS SS OS Ls) IND us) co Ve USO UO Re Re Re RR ® BG Soa HD DH 
8i—|————- 1 —— Orr aa ern err nr sr nr rr Se SS OS OS OC Oe ee ee 1 
36 ,, —_— — lmr rer Fa ar rar er ea ere er eer ere 1 
zy, i-—-- oO Or oer Ll—- —- errr area er ee er er er er er 1 
58 ,, 1l——--—— — — - - Cre er er rr rr err err er err er err er errr errr cr 2 
39 ,, —- 1 ——— — — — — See EE 1 
40 ,, —— 112. 0—— Lm me Oa a awa a a Eee Ee Ee 4 
a 1li————— — - - e-Oerrerrr rare  ereerr er rr e ro ee err ere er 3 
i ,, —-—--—-—1l12e2e—-—- — — oe Orne err er er ee ee ee er er er 6 
438 ,, ——1l— 1 4— 2 QD mE EE 10 
44 5, —1l—— 464 5D — — me a SE EE 22 
45 ,, ——-t!36565¢6¢%77 82 l Lo 37 
4G ,, ——2— 49 91012 11 5 40 3B ee ee 69 
47, 1— —— 1 213121619 13 18 8 68211— — — —  —  — — — — +] 7 | 
ig, |——--—-—-— 7141418 40 2 15 10 8— 1 1T———— 1 —-— —- — — — — 158 
49 ,, ——- 1 — 11 51118 2 33 21 2410 8 6 1 1 — l——— — — 171 
50 ,, — —— 1 2— bt1 18 3% 32 38917 345 S— 1 lee — — 176 
51 ,, —-—— -— — 1 4 16 17 20 241521 78 4<122d801 —-——— — — 144 
6, ~————-—-—-—---— 2 8 12 225 91612 43 4— 11—————— 122 
53 ., — — — — — — — 1 — l 3 3 51310131111 5 27 &@———1l1—— 89 
5h ,, — — — Ol l l ti8s898m:87738322123231—-——— 55 
DE rs eee ee a a oe, ORS ee SS 2 a eee 33 
56 ,, — — SO Ii— — 2@i2a—itiirioss8skgiist— 1—— 17 
es —- —- — — — — —-—- —- — l1-—- l—— 1—— 1 1l gr — — — 1— 8 
58 ,, — — Sl — «-—- —— 1 ire LHe — H— 1 eer 4 
bo, f— — — — eC OWI 1 
62, j——— — — Dm 1 
68, §———— — — — — — — —_ — - le —-— — — 1 
Totals] 2 4 7 5 21 51 52 73 94 133 146 118 138 96 61 65 53 36 2118 2210 6 42131 1 1244 





Glasgow. 


Height and Weight of Girls 





of Group B. 























Weight of Girls of 11°5—12°5 years 
Height ee | | | I | I | | I | Li 4 Ae ee | | | Totals 
Sea Si S$ & SF SRGEHHARES HS SB 3 & + 
= iS » ce) S —) Sore eenmnyDQmDeanana 
ssg—} 1 ———— — — - - ee 1 
Sg eae eas a4 ae TES | A ae Speen yee) Fess 
41, J—— 1—— -— — — — — — — — — 1 
42 ,, ——— 3 — =. = * e se ek ee a aes cere cael ree eee pees ee 2 | 
oes bo b 4 Se a ea ee eS ee — — — — — — — — 7 | 
Sete) OER ok ot pe a SS SS = SS SS Se 8 | 
Oo —— 364 232— [= me 16 
46 ,, —l1l— 3 8 7 ye fey eee a ng arg ee ee gn =e 24 | 
ee lL 11811 12 6 4 1-2 —————-— —— — — — — — — 47 
48 —— 2417 17 18 9 56 1 — 383 1————— — — — — — — 77 
49 1—— 1 5 31 20 20 19 5 {— — —- — — — 106 
50, |—_——— 4 13 39 35 31° 290 .5 1 2 1—--—— — — — — — — | 151 
Sa ea mae eee Yo a. ae Oe. SS ee 4-8 OP oe a ee eee 
62, 1———— 1 5 13 30 43 298 2991212 5 2 2— -— — — — — — 182 
53, J ————— 1 38 14 2 2 262012 4 6 2— ———— — —f 133 
54 5 | — ——— — 4 7 8 19 15 221613 7——— —-——— — — fj lll 
65, J——— 1 1— 2 2 4 8 20121512 42 3 1———1— —— 88 
56, J—— — — — — 2 2° 1 5841375 22 2—11—-—— 55 
et 4 — — — — — — — — ——~X0 1— CaO B 44H GK C1— 6 ZI—— — — — — 32 
08 F— — — — — — — — — — —— 8B B— — 2 8 1—— — — — — 12 
9, J — — — — — — — |1— — —— 2121 4 4—1l1i1—1 19 
eS a ee, ae ee ne er RD nee ee eee 5 
Co Ge ee a eR ee — is eS a pe eee Ea a ee fe ree 3 
oe: Se a eee ee Ea ee ee = 
i aa Ss Ure Placer wie ee, Vets Cd Ee es ee Se eee a 1 
La Serene ee es B26 as Ree ere Re ee wes 
eS eee ee l waste eteab cams," Solow Sats. <a Gee Gu as ae? aaa: ee 1 
69, Jj——— — — ee 1 | 
Totals] 3 4 14 29 54 103 140 162 171 129 131 86 74 59 3618121211 1 3:3 2 — 1 | 1252 | 




















330 


Height and Weight of School Children in Glasgow 
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Height and Weight of School Children in Glasgow 
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NUMERICAL ILLUSTRATIONS OF THE VARIATE 
DIFFERENCE CORRELATION METHOD. 


By BEATRICE M. CAVE anp KARL PEARSON, F.RS. 


In 1904 Miss F. E. Cave in a memoir on the correlation of barometric heights, 
published in the R. 8. Proc. Vol. LXxtv. pp. 407 et seg., endeavoured to get rid of 
seasonal change by correlating first differences of daily readings at two stations. 
A similar method was used by Mr R. H. Hooker in a paper published some time 
later in the Journal of the Royal Statistical Society, Vol. LXVIU. pp. 396 et seq., 
1905. This method was generalised by “Student” in the last number of 
Biometrika (Vl. X. pp. 179, 180). He showed that if there were two variates 
« and y, such that 

a= (t)+X, 
y=f d+ ¥, 
where X and Y are the parts of # and y independent of the time ¢, then the 
spurious correlation arising from # and y being both functions of the time could 
be got rid of by correlating the differences of # and y, and that ultimately, when 
m is sufficiently large : 
Tame Amy = amizamty = Ct. = Txy, 


so that the correlation of # and y, free from the spurious time (or it might be 
position) correlation, i.e. ryy, could be found by correlating the successive differ- 
ences of # and y. When the correlations of the differences remain steady for 
several successive values, then we may reasonably suppose that we have reached 
the correlation 7yy*. 

This method is still further developed by Dr Anderson of Petrograd, who in 
a valuable memoir published in this Jowrnal has provided the probable errors of 
the successive ditference correlations of a system of variables : 

oe Maro oe 

i. ae 

* Having been in communication with ‘‘ Student,” while he was writing his paper, I know that the 
interpretation put by Dr Anderson (Biometrika, Vol. x. p. 279) on ‘*Student’s” words (Ibid. p. 180) is 


incorrect. ‘‘Student” had in mind, if he did not clearly express it, the ultimate steadiness of 
Tamy Amy for a succession of values of m. K. P 
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where the correlations of random pairs of values of the variates, or the product sums, 


S{(X,—- X)(X,-X)}, 
S{((Y,-—Y)(¥,-Y)}, 


are both zero. 


Dr Anderson has further provided us with the values of the standard deviations 
of the successive differences, i.e. 
Trmy 80d Crnys 


which represent the ultimate values of c,,,. and 7 jm,» When we have carried m so 


Amr ny? 


far that the time effect has been eliminated. 


The new method appears to be one of very great importance, and like many 
new methods it has been developed in a co-operative manner, which is a good 
reason for not entitling it by the name of any single contributor. We prefer to 
term it the Variate Difference Correlation Method. 


With the exception of a few illustrations given by “Student,” no numerical 
work on the correlation of the higher differences has yet been attempted. It is 
clear that much numerical work will have to be undertaken before we can feel 
complete confidence in our knowledge of the range and of the limitations of the 
new method. We have yet to ascertain how far in different types of material 
a real stability of difference correlations is ultimately reached, and how far 
various assumptions made in the course of the fundamental demonstration apply 
in dealing practically with actual statistical data. One of the most important 
assumptions made if there be n values of the variates is that arising from the 
reduction in the number of values as we take the means which occur in successive 
differences, and a like assumption is made in the case of standard deviations. 
Thus for exampie: 

12, — 
— S(X,) = X, 
1 


A 
n 





but Fj 


it is assumed to be, unless n be very large. Similar remarks apply to the sums used 


n-1 
"S(X, — X,,,;)=(X,—X») /(n—1), and will not be sensibly zero, although 
1 


= 


. Ki : a ea Ps ees 
in the standard deviations, i.e. we assume in the proof ss S (XZ)= - oh. Gm 
iat 1 1 


—1] 
Ultimately with the mth differences we come in the proof to relations of the 
type 
n—™ 2 1 2 z 
S (X,) =— 8S (X,) 
l nm 


a= 7 


and 
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Now such relations will undoubtedly be very approximately true, if the X’s 


are random variates uncorrelated te each other, and provided m is small compared 
with x. These conditions seem amply satisfied when we proceed to fourth or sixth 
differences in barometric pressures, taken, say, over ten or twelve years; the 
addition of four or five daily pressures will hardly affect sensibly either the mean 
or the standard deviation. But such extensive data, while not only involving a great 
deal of Jabour in the difference work* are not those which, perhaps, most frequently 
demand the attention of the statistician, whether he be economist, sociologist 
or a student of scientific agriculture. In such cases it not infrequently happens 
that the available data only provide a range of 20 to, perhaps, at most 50 years ; 
and we need to discover whether there is a true relationship between our variates, 
apart from a continuous change in both due to the time factor. At present 
accurate statistics of annual trade or revenue, or satisfactory annual demographic 
data hardly éxtend at most beyond a period of 50 years. Very often—under 
even approximately like methods of record—we shall hardly have more than 
twenty years’ trustworthy returns. Not only has the method of record been 
changed, but the conditions of transit and trade may have been immensely 
modified and in a manner which we could not suppose to be even approximately 
represented by a continuous function of the time. 

The object of the present paper is to illustrate the theory of the variate 
difference correlation method in its present stage of development on a short series 
of economic data, in order to test what approximation there is in such short series 
to stability, and further how nearly Dr Anderson’s values for the successive 
standard deviations apply to such cases. We have selected as our data ten 
economic indices of Italian prosperity for the years 1885 to 1912, together with a 
“Synthetic Index,” formed by taking the arithmetic mean of the ten economic 
indices referred to. These eleven indices are given by Professor Georgio Mortara 
in an interesting memoir: “Sintomi statistici delle condizione economiche 
d’ Italia” which was published in the Giornale degli Economisti e Rivista di 
Statistica, for February, 1914, and form Tabella I, of that memoir, which we 
here reproduce in part as Table I. The indices in each case are obtained by 
dividing the returns for any year by the means of the returns for the years 
1901—05, inclusive, and multiplying as usual by 100. 

The indices are for returns of (i) Gross Receipts of Railways, (ii) Shipping, 
loaded and unloaded at the ports, (iii) Effective Revenue of the State, (iv) Inter- 
national Commerce, Value of Imports and Exports, (v) Number of postal 
Letters and private Telegrams, (vi) Amount of Stamp Duties, (vii) Savings Banks’ 
Returns, (viii) Impoi tation of Coal, (ix) Gross returns of consumption of Tobacco, 
(x) Returns of Coffee imported. Professor Mortara has drawn attention to the 
very high correlations of these individual indices with each other, and of each of 
them with the “Synthetic Index.” The latter correlation is, however, to a certain 


* A discussion of the correlations of the higher differences in barometric pressures will we hope be 
shortly issued. 
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extent spurious. For if J,, J,,... J,) be the individual indices and J, the synthetic 
index, then J,= 5 (J, + J,+ ... + Ji) and any individual index I,, if there be no 
correlation between such individuat indices, would give 


1 Y td T 1 Y 2 
a” (Ls — 16) (Iq — 14) = 75, S(1,- I,) 


1 1 T\2 1 2 2 . 
ai a SC, —- [y= 100 (07, 5 oi, a ala + o",,,), 


and accordingly 


TABLE LI. 
Professor Mortara’s Table of Index Values of Italy. 
Numerical Index (Mean 1901—1905) = 100. 





a ® uw. bal 
| >| #1 8 | ss] Fe co 3 5 E 
4 s rt a F=| ee S S ra = 
Year es = : a = 2 5 5 ) a 
e/a | | $8|a3 a a E 
| - a 
1885 61 63 78 72 38 82 17 53 82 98 67°4 
1886 62 63 79 74 40 87 53 52 86 94 69°0 
1887 68 73 82 78 42 94 Dd 64 88 91 73°5 
1888 70 71 83 62 13 98 | 57 69 86 81 72°0 
1889 71 77 85 70 45 99 | 58 71 86 78 74°0 
1890 72 i8 56 66 46 97 | 59 78 87 8] 75°0 | 
1891 72 72 85 60 48 96 | 60 70 88 80 73°1 
1892 71 75 86 64 51 96 64 69 89 80 74°5 
1893 70 70 85 64 55 95 65 66 99 73 73°3 
1894 72 72 86 63 57 93 65 84 89 71 73°2 
1895 73 76 89 66 60 92 68 4 | 88 70 75°9 
1896 75 77 90 67 64 94 70 73 88 73 77°1 
1897 78 80 90 68 68 96 73 76 87 75 79°1 
1898 81 84 91 78 73 96 73 79 89 78 82°4 
1899 86 88 93 88 75 96 79 87 91 82 86°5 
1900 89 89 94 91 78 96 83 88 92 82 88°2 
1901 90 91 96 92 85 96 87 86 95 92 91°0 
1902 96 99 98 95 93 97 92 96 97 94 95°7 
1903 101 103 99 99 103 99 99 99 99 102 100°3 
1904 105 102 101 103 111 | 101 107 105 102 103 104°0 
1905 | 108 | 105 | 105 | 111 108 |107 | 115 | 114 | 106 | 109 108-8 | 
1906 119 123 108 132 107 115 | 127 137 109 118 119°5 
1907 | 126 125 108 144 114 120 144 148 115 125 126°9 
1908 134 129 113 139 122 120 155 150 124 132 131°8 
1909 139 140 121 149 132 123 168 165 131 140 140°8 
1910 146 146 129 159 140 130 180 166 138 147 148°] 
1911 155 156 135 167 149 136 187 171 144 153 155°3 
1912 165 169 138 179 158 141 192 179 151 160 163°2 





| SERSEES, eos Wee S| Sees he 
Mean | 94°8 | 116°3| 97°6 | 96°4 | 82°3 | 103°3| 95°9 | 99°0 | 100°6 | 98°6 96°5 
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which would have a substantial, but spurious value amounting to ‘316 if 
77, =O, = Oy. If there be high correlation between the individual indices, of 


course, the correlation of each individual index with the arithmetic mean or 
synthetic index will also be high. Thus our Table IV shows it to range from ‘952 
in the case of Coffee to 998 in the case of Railways. Possibly a third of this 
correlation may in some cases have a spurious origin. But the individual indices 
are very highly correlated together; only two such correlations are below ‘9, and 
the lower of these two is as high as °885. We are accordingly left with fifty 
correlations ranging from ‘885 to ‘997 between the individual indices, and if we 
accept these as true measures, then it is clear that any one of these ten indices 
might be used as a reasonable index of Italian prosperity; it would for practical 
purposes be idle to calculate them all or to table their arithmetic mean. 


But the high correlations found lay themselves open from our present stand- 
point to some suspicion of being solely due to the fact that during the 28 years 
under consideration Italy has progressively increased in population and ac- 
cordingly the consumption of innumerable goods and the means of interchange 
have all grown together with the time. In other words the correlations we give 
under the heading of “quantities” in each separate section of Table IV are very 
high solely because the individual indices are variates increasing one and all as 
continuous functions of the time. 


The material therefore seems especially suited to the application of the 
variate difference correlation method. For example, the correlation between the 
indices for tobacco and savings is ‘984; are we to interpret this to signify, that, 
if there are large savings this means that much will be spent on tobacco? Or, 
is this high correlation simply in whole or part spurious, merely indicating that 
both savings and consumption of tobacco increased markedly with the time ? 
Actually the correlation of first differences drops from ‘984 to ‘766, that of 
second differences is negative if insensible, while from there onwards it steadily 
increases negatively, till with the sixth differences we reach —*431, which seems 
to indicate that, when time has been eliminated, expenditure on tobacco in any 
year means less money saved. Again the coffee and tobacco indices appear very 
highly correlated, ‘955, but by the third ditference correlation we have reached 
about a third of the relationship, ‘319, which is scarcely altered in the sixth 
difference correlation, *326; we may assume therefore that there is probably a 
moderate ’ 


“organic” relationship between the expenditures on coffee and tobacco, 
but the association is nothing like as close as would be suggested by the corre- 


lation of the raw indices. 

The work has been done in the following manner: The successive differences 
of the indices up to the sixth were found. The means and standard deviations 
of these differences were calculated, and the correlations were then worked out 
in the product-moment manner. This involved the very laborious work of 
determining 385 coefficients, and then to these coefficients were added the 
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probable errors as found by Dr Anderson’s formulae. These probable errors are 
of course those of the correlations of A™X and A”Y, and will not be the correct 
values for the probable errors of the correlations of A™x and A”y, until A™z = A™X 
and A"™y=A"Y, i.e. until m is sufficiently great for t to have been eliminated. 
Further their accuracy depends on the vanishing of the means of the differences 
or on the equalities of the sums like 


l n 
n—1 





pat! 1 ee 
S(X,), —~S(X,), ete. 
1 n—15 


which, while true on the average, will only be approximately true in the actual 
instance if n be large. We give in Table II, the Mean Values of Index 
Differences. 


TABLE IT. 


Mean Values of Indices and their Differences. 














3 ‘7 n = o 
>| # | s | 82) 32 ial aa 8 e-|3 83 
a a eo) s¢ 8.2 gais 3 2 Dn 9H 
=| Be 5 aefig®| $s/¢3)| 8 = S |8°ss 
a ‘3 ® EE | of nA | SA S) S ss 
a n s 25 | a | @ a i ® 33 
| } | | 
} — = i a = = a ae oe | | Rae ee 

Quantity ... | 94:8 | 1163) 97°6 | 96-4 | 82°3 | 103°3 | 95°9 | 99°0 | 100°6| 98°6 | 96°5 
Ist difference |— 3°85 |—3-93 - 222 | - 3:96 |~ 4-44] ~2'19 |— 5:37 | 4°67 —2°56|—2°30! —3°55 
2nd Bi — °35|- 50|/— 08|— °38]/— -27 00}+ 04|/-— ‘35/- ‘12/— °42 ~- 24 
3rd s + ‘16\+ -°28'4+ -20;-— 08} 00/+ ‘12!— 08/+ ‘40|- ‘12 00! + ‘09 
4th re — 33}— ‘88|— 13/-117} ‘O0/— °28/+ 04/- ‘79|- ‘16/-— 42| — °40 
5th id + 57|+2°26'4+ -26)+2°52/+ -30 00 |— 52\/+4+1°87|+ 22/4 87) + ‘83 
x — $2/—2-00 |- 110 


‘55 | —6'41/+ 18/-— °73|— 05|+2°00|— °95)|-1°68 


It will be seen from this table that the means of the differences are far from 
zero even when we have reached a difference for which we may suppose the time 
to have been eliminated. This arises from the smallness of the series dealt with 
and shows us that we ought not to anticipate more than a rough accordance with 
theory, or only an approximate steadiness, for sums like 

u—m™m 
: S (X;) 


n—™M 4 
may grow less and less steady as m increases. 


Similar considerations apply to the standard deviations of the differences. 
These will not at first obey Dr Anderson’s formulae because they are the values 
for S ame and T my? and when we have taken m sufficiently high for 


Fm, and T Amy to be theoretically equal to o,,. and oamy> 





m 
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and accordingly the correlation should have begun to be steady, then some failure 
to obey Dr Anderson’s formulae will arise, because the means of the differences 
are not truly zero and equalities of the type 


1 22). 1 n—1 s. 
— S (XZ)= —\ S (X%4,), ete. 
—1 1 n—1l 1 


n 


will not be satisfied when n is relatively small. 


Dr Anderson gives the value of o* kmx in terms of oy, but we do not of course 


know o?,, which will be very different from o?,, and can only practically be found 


from the value of o? itself, after that value has become equal to o° Le. after 


A™x A™xX? 
steadiness has set in. In order therefore to test the formulae we have formed the 
ratio of 


Orne O nm-t from m=1 to 6. 


9 


This equals 4—— in Dr Anderson’s formulae for O* snx/O* am-ix and therefore 
m , 2 


we have a good measure of the approach of A™z to A™X, or of the growth of 
steadiness as apart from the correlations. The following Table IIs gives the 
values of the ratios of the squares of the standard deviations, theoretical values, 
actual values and the mean value for each of the differences of the ten 
individual indices. 


TABLE III. 


bo 


Values of GO ym | OF? Am—1z and their approach to 4— 





m 

| | } | Mean of 10 
w a © ‘ ae ° Index 
, =) 3 | 20 on 3) 4 . a 

ag | 3 ‘B = a | s 3 3 = Difference 

aS | & = a 2 3) < S Standard 

| wo | a a=} a = Deviation 
| } Ratios 
ras pee | | | | | 
12 | O12} 012) ‘031 ‘O19 “O38 “009 “040 | “O10 "035 "022 ‘036 “025 
| 3 “705 ‘708 | 1°834 *763 | 1°720 *799 *D85 *350 | 2°074 *352 | °843 1°003 
3°333 | 3°107 | 2°816 | 3°093 | 2-124 | 3°032 | 1°959 | 1°660 | 2-214 | 3°075 | 2°213 | 3°307|  2°549 
3°5 3°167 | 3°128 | 3°174 | 2°747 | 3°213 | 2°597 | 2°008 | 3°106 | 3°379 | 3°025 | 3°619|  3°000 
3°6 3°143 | 3°449 | 3°189 | 3°020 | 3°104 | 3°010 | 2°328 | 3°275 | 3°580 | 3°117 | 3°701 | 9 3°177 

3°164 | 2 


3°667 | 3°149 3°711 | 3°195 | 3° 


2°881 | 3°208 | 2°499 | 3°455 | 3°682 | 3°101 | 3°791 | 3°269 


It will be clear that until we reach the ratio of the square standard deviations 
of the third and second differences, there is no general approach to steadiness. 
After m= 8, however, for m= 4, 5 and 6, the ratio of the values for the mean of the 
series of individual indices to the theoretical value is ‘86, ‘88 and ‘89, respectively. 

Thus, there is increasing approach to agreement in the observed and theoretical 
values, but the approach is slow, and we believe that there is greater steadiness than 
is really indicated by this test. The source of this apparent unsteadiness lies we 
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think in the relative largeness of m compared with n (i.e. at a maximum 6 as 
compared with 28), rather than in our not having taken sufficiently high 
differences. 


We now turn to the correlations. These are given in Table IV, the actual 
values of the standard deviations of the quantities and their differences being 
recorded along the diagonal cells, while the other cells contain the correlations of 
each pair of variates and of their successive differences. 

We will now consider these correlations in detail. 

(a) Synthetic Index (Arithmetic Mean) with Individual Indices. 

We see at once that the Synthetic Index is highly associated with Shipping 
(>°85), with Importation of Coal (? >°75), and International Commerce (? < *68), 
and fairly highly with Revenue (c. ‘55). On the other hand the sixth difference 
correlations with Post (c. °15), with Stamp Duties (c. ‘24) and with Savings (> *23) 
are all such (i) that they might well have arisen from the spurious element in the 
Synthetic Index correlations, and are all less than their Andersonian (steady 
value) probable errors. Almost the same may be said of the Railway Index ; it 
is not beyond suspicion of being spurious, and is scarcely significant having regard 
to its probable error. The Consumption of Coffee is also not very closely associated 
with the Synthetic Index ; it is only about twice its probable error (427 +°205), 
and a good deal of its value may be spurious. Further in the case of both 
Coffee and Railways, the correlations are still falling between ‘04 and ‘05 for each 
difference. The last individual index remaining is that for Conswmption of 
Tobacco and although the correlation of sixth differences is not really significant 
it is negative (— ‘247 + °235), and is exhibiting a steady negative rise. 

Stripped therefore of the common time factor the Synthetic Index will be seen 
to be no very appropriate measure of trade, business activity, and spare money for 
savings and luxuries. With Post, Siwmp Duties and Savings, it has probably only 
a spurious relationship, expenditure on railways has little influence, that on 
luxuries is very slightly significant, or indeed in the case of tobacco negative. It 
is, however, closely related to variations in external trade, i.e. imports (including 
coal), to exports and shipping and to effective revenue. It appears to us that a 
suitable general index of prosperity, which will distinguish between a continuous 
growth in all factors with the time, and favourable and unfavourable fluctuations 
from this growth, can only be obtained, when there has been far more ample 
study of the associations of individual indices among themselves, and of these 
indices after they have been freed from the time factor, ie. of associations 
between high difference correlations. From this standpoint the study of General 
Index theories is at present in its infancy. 


(b) Railway Index. This index is very noteworthy in the nature of its 
associations after removal of the time factor. We have reached a steady 
correlation (c. 62) with Shipping, but beyond this no values of first class im- 


portance appear. The relation of Railways and Revenue after falling practically 
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350 Illustrations of the Variate Difference Correlation Method 


to zero, now stands at something greater than “42 and might rise higher, but 
the relation to International Commerce as a whole is zero, which suggests that 
the goods imported and exported are not in the bulk carried by rail. Further 
although the final value of the Railway and Post correlation is scarcely sensible 
(— ‘214 + 239), it has been continuously negative from the second difference, and 
thus suggests that increased expenditure on the post means lessened profit for 
the railways. This might be interpreted in two ways: (i) that business con- 
ducted by post or telegram lessens rail intercommunication by person, or (ii) that 
in the case of state-railways, there is not an increased profit to the railways from 
carrying larger mails. But still more remarkable are the negative correlations of 
Stamp Duties, Savings, Tobacco and Coffee with Railways; none of them are very 
large, and all but savings, perhaps, of the order of their probable errors. But 
taken as a whole they suggest that when the Italian spends little money in going 
about, then he saves more, or spends more on such luxuries as tobacco and coffee. 
Lastly we have the Coal Index. It might be supposed that a year with great 
coal importation would signify great railway activity, and this-is the judgment 
which would be made from the raw correlations of these variates. But the actual 
facts are exhibited in a correlation still falling at the sixth difference and hardly 
significant having regard to its probable error. The inferences formed must be: 
(i) that imported coal is used largely at the ports of disembarcation or travels 
inland by other than railway transit, (11) that the imported coal is largely used on 
the railways themselves and that its cost is a heavy tax on their resources. 

(c) Shipping Index. As we might anticipate this is highly correlated with 
(i) Railways (c. *62), (ii) Revenue (c. ‘75) and less highly but very significantly with 
(iii) International Commerce (c. 54) and (iv) Coal (c. 58), but it appears to have 
no relation whatever with Post, Stamp-Duties and Savings, and when we come to 
luxuries, their importation is clearly not a factor of shipping prosperity. Neither 
in the case of Tobacco nor of Coffee are the correlations really significant ; with 
the former we have an increasing negative correlation and with the latter a 
decreasing positive one already below its probable error. Thus we see that 
neither directly by bulk of importation nor indirectly by immediate increase of 
consumption, does a rise of shipping mark significant rises in the use of luxuries 
such as J'obacco and Coffee. It would be of interest to ascertain whether in- 
creased consumption of luxuries does not rather follow than accompany favourable 
trade fluctuations. 

(d) Revenue Index. This index as we might expect is fairly highly cor- 
related with Shipping (c. ‘75). It has relatively small relation to Railways 
(422 +206) at least at the sixth difference and a somewhat similar value 
(c. 42 + 20) for Coffee. Thus the suggestions arise that revenue is but little 
produced by the railways and that coffee is not a very large factor of the custom 
dues. It is astonishing to find, however, that Post, Stamps and Savings have 
negative correlations with Revenue of — ‘388 +°213, —°255 + 234 and —-154 + -244 
respectively, which, if scarcely significant, have been in each case for several 
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differences persistent in sign. Even the correlation with Tobacco is small, falling 
and insignificant (< ‘115 + °247), and that with Coal which might be supposed to 
be high as marking good trade times is hardly significant although apparently 
rising (? > "270+ 232). Lastly the correlation of Revenue with International 
Commerce is again small, falling, and insignificant (< 214 +°239). Thus Revenue 
or the “entrato effectivo dello stato’ 


> 


seems to provide an index which has little 
valuable relation to any other characteristic of prosperity beyond shipping. 


(e) International Commerce. Here we find no single final individual index 
correlation greater than 54, which is that for Shipping. The next most important 
correlations are with Post (>°47) and Stamp Duty (c. -46). With Railways the 
correlation is zero, and with Revenue also falling and insignificant. With Savings, 
Coal, Tobacco and Coffee the correlations are all insignificant ; in fact in the last 
three cases not only are the values less than their probable errors, but they are 
still falling. It is thus clear that in Italy the total of Exports and Imports is 
no measure of all-round prosperity, they do not immediately increase either 
savings or the consumption of luxuries. 


(f) Post and Telegrams. Here we have the lowest series of correlations we 
have yet reached. Post values have no significant relation to fluctuations in 
Railway (c. — ‘20 + °24), to Shipping (—°059 + 249), Stamp Duties (—-027 + 250), 
Coal (— °050 + °250), Tobacco (+ °108 + °247) or Coffee (— ‘133 + -246) Indices. It 
is significantly correlated only with International Commerce (>+°47 +°19) and, 
perhaps, significantly with Savings (+ ‘336 +°222) but negatively with Revenue 
(c. — 38 +'21). In short the number of letters and telegrams in Italy is hardly a 
mark of any other favourable fluctuation in prosperity, beyond International 


Commerce. 


(g) Stamp Duties. This Index is correlated positively and significantly with 
International Commerce (c. + *46 +°20) and positively, and doubtfully with Savings 
(c. + °35 4°22). It is correlated insignificantly and negatively with Railways 
(—'261 +°233), Sh upping (— 009 + :250), Revenue (—'255 +°234), Post (—-027 +250), 
and Tobacco (—‘129 +246); it is correlated positively and insignificantly with 
Coal (+ 052 + 250) and Coffee (+ ‘222 + °238). Thus again freed from continuous 
time changes, fluctuations in the Stamp Duty Index are of small value as a 


measure of contemp yranecus general prospe rity. 


(h) Savings Bank Index. There are practically only two correlations of any 
importance with Savings and these are both negative, namely those with Railways 
(—°431 + 204) and with Yobacco (—°431+°204). Hence it would appear that 
when the Italian people is in a saving mood, it spares on transit by rail and on 
the consumption of tobacco, and when it expends on these luxuries, then it does 
not save. Savings have small and possibly not significant correlations with Post 
(>+°33 +°22) and Stamp Duties (< +°353 + 219), and insignificant and positive 
correlations with International Commerce (>+ ‘27 + °28), Coal (> +19 + 24) 
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and Coffee (c.+°05 4°25); they have insignificant negative correlations with 
Shipping (< — 03 +°25) and Revenue (< — ‘15 + ‘24). 





Savings are thus—apart from continual time change—no very satisfactory 
measure of general prosperity, and a fluctuating increase is usually accompanied 


by a reduction of luxuries. 


(t) Coal Index. The importation of coal has little relation to any factor of 
prosperity besides Shipping (c.+°58+°17). With Railways the correlation is 
not quite double the probable error and the value, even at the sixth difference, 
appears still falling. The correlation with Revenue only just exceeds the probable 
error (+ ‘270 +232). With International Commerce (+ ‘171 + '243), Stamp Duties 
(+ 052 +250), Savings (+ ‘196 + °241) and Coffee (+ °152 + °245) the correlations 
are less than their probable errors, small and in some cases still falling. With 
the Postal Index, the correlation is negative, insignificant and falling. Alone in 
the case of the Tobacco Index does the correlation appear to be nearly as 
significant as in that of Shivping, but it is negative and increasing* (—‘514 + ‘184), 
while in the case of Shipping it was steady. It is singular to find that Coal, the 
increased import of which should mark increased industrial activity, is, beyond 
the naturally influenced Shipping, alone effectively associated with the con- 
sumption of Tobacco. 


(j) Tobacco Index. This is of considerable interest as marking the association 
of indices of trade prosperity with the consumption of a luxury. With four 
exceptions J’obacco is negatively correlated, although often insignificantly, with 
the other indices. Revenue (+°115 + ‘247), International Commerce (+°015 + ‘250), 
and Post (+ 108 +°247) are all positive, insignificant, and in the first two cases 
still falling. The correlation with Coffee is positive and might, perhaps, be 
significant (+°326 + °224), but it appears to be still falling. With Coal and 
Savings there are probably significant negative correlations (—°514+°184, and 
— ‘431 +°204 respectively); with Railways (— ‘243 + °236), Shipping (—-271 + °229) 
and Stamp Duties (— ‘129 + ‘246) there are insignificant negative correlations, but 
they tend to confirm each other in sign. Thus we see that the consumption of 
tobacco can hardly be considered as a measure of general prosperity ; it appears 
to be greatest when trade conditions are unfavourable, and in particular when 
savings are least and manufacturing conditions as measured by the importation of 
coal are slack. The result suggests the pipe of the unemployed at the street 
corner, rather than the increased expenditure of the fully occupied artisan. 


(k) Coffee Index. This is another luxury and the results are very similar. 
There appears a significant correlation with Revenue (+°400 + ‘210), which might 
easily be explained, and there is a falling but possibly significant correlation 
with Tobacco (+°326+°224). With all other indices the relationships are 


* It is, perhaps, hard to believe that so much smuggling could be carried on in colliers, that it would 
seriously affect the profits of the tobacco monopoly ! 
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insignificant. Railways (—*204 + °240), Shipping (+°232 +:237), International 
Commerce (+142 + °245), Post (—*133 +246), Stamp Duties (+°222 +238), 
Savings (+ °046 + °250) and Coal (4+°152+4°245). Apart therefore from the 
general increase of consumption with the time, during which time the general 
prosperity of the nation has increased, it would not appear that the consumption 
of a luxury has any organic relationship to prosperity. We do not find that a 
favourable trade fluctuation is associated with increased consumption of luxuries. 
In fact the suggestion arises that in the case of tobacco the consumption may be 
greater in a period of depression. 


Conclusions. While we lay no special stress on any of the results suggested 
by the difference correlations above studied—far more intimate economic know- 
ledge of Italian affairs and methods of measurement would be requisite—we yet 
venture to insist on one or two general considerations. 


The very superficial statements, so frequently met with, that such and such 
variates, both changing rapidly with the time,-are essentially causative will 
doubtless cease to have any scientific currency, directly the method of variate 
differences is fully appreciated. We shall no longer assert that the fall of 
the phthisis death-rate can be off-hand causatively associated with the con- 
temporaneous rise in the number of persons dying in institutions, or that the 
increased expenditure on luxuries is necessarily a measure of increased national 
prosperity. 


If we turn as in the present paper to the actual correlations of the indices 
themselves, we find in every case an arid and scarcely undulating waste of high 
correlation. No one can obtain any nourishment whatever from the statement 
that the Tobacco Index is correlated with the Revenue Index to the amount of *983 
and with the Savings Bank Index to the extent of ‘984! The organic relationship 
between these variates is wholly obscured by the continuous increase of all three of 
them with the time. But when we proceed to sixth differences and see that the 
consumption of tobacco has little, if any, relation to revenue, and is associated 
substantially but negatively with savings, we seem to touch realities, and realities 
of some worth. Again what can we learn, if we are told that the Shipping Indez is 
correlated to the extent of ‘99 with both the Revenue and the Savings Bank Indices? 
We might imagine, that increase of shipping was not only the primary cause of 
increase in Italian revenue, but also the essential origin of any increase in the Italian 
peasant’s and artisan’s savings! An appeal to the variate difference method shows 
how fallacious such imaginings would be! An examination of the sixth difference 
correlations shows that while prosperity of the revenue is closely associated with 
trade as measured by shipping (‘77), the correlation is not nearly perfect; on 
the other hand there appears to be no significant organic correlation at all 
(— ‘154 + 244) between the prosperity of the revenue and the savings of the 


Italian populace. As we have noted a knowledge of local conditions and methods 
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of reckoning quantities might enable us to put other and, perhaps, more luminous 
interpretations on our results. But there can be small doubt that to proceed from 
the actual correlation of such indices to the correlations of their higher differences 
gives the feeling of clearing away the sand of the desert, and reaching all the 
ordered arrangements of an excavated town below; the slight undulations of the 
waste above are really fallacious, and enable us to appreciate nothing of the 
actual topography of the city. 


The method is at present in its infancy, but it gives hope of greater results 
than almost any recent development of statistics, for there has been no source 
more fruitful of fallacious statistical argument than the common influence of the 
time factor. One sees at once how the method may be applied to growth problems 
in man and in lower forms of life with a view to measuring common extraneous 
influences, to a whole variety of economic and medical problems obscured by the 
influences of the national growth factor, and to a great range of questions in 
social affairs where contemporaneous change of the community in innumerable 
factors has been interpreted as a causative nexus, or society assumed to be at 
least an organic whole; the flowers in a meadow would undoubtedly exhibit 
highly correlated development, but it is not a measure of mutual causation, and 
the development of various social factors has to be freed from the time effect, 
before we can really appreciate their organic relationships. 


In the present paper we have dealt only with very sparse “ populations” (only 
28 values of the variates), but this has enabled us to consider not only a very 
large number of correlations, but to see the practical influence of terminal con- 
ditions on our theory. This may we think be summed up in the statement that 
the Andersonian formulae for the standard deviations will hardly in many practical 
cases be more than very roughly approximated before the size of the population 
becomes too small to make the deductions reliable. Further in most cases our 
difference correlations have hardly even with the sixth differences reached a steady 
state. Possibly they have done so in the cases of Rail and Shipping, Shipping 
and Post, Shipping and Coal, Revenue and Post, International Commerce and 
Stamp Duties, International Commerce and Savings, Savings and Coffee, and in 
one or other additional pair. But in the great bulk of instances there is still a 
more or less steady rising or falling appreciable in the difference correlations, and 
all we can really say is that the final value, the true ryy, will be somewhat 
greater or less than a given number. From an examination of the actual 
numerical working of the correlations, it appears to us that the terminal values 
are in the case of these short series of very great importance. It is further clear 
that the theory as given by “Student” depends upon certain equalities which 
are not fulfilled in practice in short series. We await with much interest the 
complete publication of Dr Anderson’s work, and hope to find a fuller discussion of 
the allowance to be made in short series for the influence of the terminal state of 





Beatrick M. Cave anp Kart PEARSON 355 


affairs* on the steadiness of the series and on the approach to the standard- 
deviation formulae. But apart from these lesser points, our present numerical 
investigation has convinced us of the very great value of the new method of 
Variate Difference Correlations. 


* For example if we measure X from its mean, 
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relations of the standard deviation formulae for the higher differences need not hold for any individual 
case, and thus the steadiness of the difference correlation series, and the approach to the Andersonian 


formulae are very far from attained. 
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EXAMINATION OF SOME RECENT STUDIES OF THE 
INHERITANCE FACTOR IN INSANITY 


By DAVID HERON, D.S8c.* 


In the last few years a number of studies of the inheritance factor in insanity 
have been published in America, Germany and England. The value of investigation 
of such a topic cannot be overestimated. We are quite certain that the prevalence 
of insanity is not falling; many of us indeed believe that the statistics suffice to 
demonstrate that it is substantially increasing, and that we can attribute this 
increase not in the first place to the intenser strain of modern life, but to the 
greater power of modern treatment to check or temporarily cure attack, and thus 
allow wider possibility of reproduction to members of affected stocks. Indeed the 
problem seems closely associated with an essential difficulty of modern civilisation, 
the greater protection of physically and mentally degenerate stocks unaccompanied 
by any adequate limitation of their thereby increased power of procreation; the 
inheritance factor thus tends to aid the relatively greater survival of the socially 
unfit. The studies we have referred to would be of great importance from this 
aspect of eugenics if (i) the data were collected without conscious or unconscious 
bias, and (ii) the inferences drawn from them followed logically from the data thus 
collected. 

Unfortunately it is not only in the interpretation of statistics that adequate 
training is required. It is equally important that in the actual collection of them 
we should . proceed, not only free from the bias which arises from the hurried 
acceptance of dogmatic theories of heredity, but what is often still more needful, 
free from the bias which is almost certain to waylay our progress, if we have not 
initially considered with trained insight the fallacies which may result from our 
method of recording or even tabulating our material. The day of the amateur in 
science is gone; no one now pays any aitention to men who propound elaborate 
atomic theories or stellar hypotheses, without having had preliminary training in 
physical or astronomical science. There are still, however, some who appear 
willing to accept the statement of statistical data or the inferences drawn from those 


* This paper formed the second portion of a lecture given at the Galton Laboratory on March 3, 1914. 
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data by men who have clearly had no adequate training in statistical science. The 
craniologist, the anthropologist, even the biological student of heredity and evolu- 
tion are recognising that a statistical training is needful for the true interpretation 
of many of the facts in their special fields of research. The physiologist still 
appears to believe that he can deal with the average effects of diverse dietaries or 
the pathologist with the “ mass-phenomena” of the hereditary factor in insanity 
without any training in statistical method. A physicist might just as logically 
assume that without mathematical training he could give an adequate mathe- 
matical account of a physical phenomenon, or a cosmic theorist suppose that he 
was effectively furnished for astronomical research by the perusal of a popular 
primer on the stars! The statistical calculus cannot be mastered by any easier 
road than the differential calculus, or, to put a more apt illustration, statistical 
training is as needful a preliminary to the handling of statistics, as time spent 
in a physiological laboratory to the effective handling of tissues. In twenty years 
it will be unnecessary to insist on these points, they will be universally recognised 
in the courts of science; but at present it is not only necessary to reiterate 
unpleasant truths, but to emphasise their validity by illustrations which bring 
home forcibly to scientist and layman alike the danger of amateur statistical 
handling. To state that a man is in error is not sufficient, if he continues time 
after time to repeat his assertions, apparently under the belief that incessant 
repetition will convince the world of the value of his theories. 


In the case of the inheritance factor in insanity we are not dealing with any 
purely academic question of science. We are up against one of the most difficult 
problems of modern life, where true advice is of urgent importance to the nation 
as well as to the individual. It is not only the medical man but the layman who 
seeks guidance in the question of the marriage of members of insane stocks, and a 
laboratory like the Galton Laboratory knows how often advice on such points is 
sought. It is disheartening when help is rendered to the seeker to be faced with 
the criticism: “ But Professor ———— says I may marry if I take a wife of sound 
stock,” or “ Dr ———— recommends marriage, although my father was insane, because 
I am over twenty-five and still sane myself.” When teaching of this kind, arising 
solely from false interpretation of defective data, is spread widecast in a dozen 
different papers or journals, it is not sufficient to issue a brief statement of its 
futility. It is needful to give it the coup de grdce by a more lengthy criticism of 
its fallacies and their illustration in a form more likely to impress the imagination. 
The attempt is made in this paper to deal with only one of the authors, who have 
contributed fallacious eugenic rules to those seeking knowledge on the influence of 
the hereditary factor in insanity. 


In a long series of papers Dr F. W. Mott, Pathologist to the London County 
Asylums, has stated that when the children of insane parents become insane, they 
do so at a much earlier age than did their parents, and on the basis of this assertion 
he has drawn some very sweeping conclusions for practical conduct. Thus in the 
Biometrika x 46 
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British Medical Journal of May 11, 1912 (p. 1060), he states that “this signal 
tendency of insane offspring to suffer with a more intense form of the disease and 
at an early age, as shown in the above figures and tables, is of great importance 
for the following reasons: first, it is one of Nature’s methods of ending or mending 
a degenerate stock; secondly, it is of importance to the physician, for he can say 
that there is a diminishing risk of the child of an insane parent becoming insane 
after he has passed 25, a matter of great importance in the question of marriage ; 
thirdly, it is of importance in connection with the subject of social surgery of the 
insane, for when the first attack of insanity occurs in the parent the children for 
the most part have all been born....Sterilization would therefore be applicable to 
relatively few parents admitted to asylums.” 


Put briefly, Dr Mott’s views are that in “Antedating” or “ Anticipation,” in 
this alleged tendency of the offspring to become insane at any earlier age than 
their parents, we have Nature’s method of purifying degenerate stocks, that the 
children of insane parents who are still normal at the age of 25 may safely marry*, 
and that it is useless to take any special measures to limit the reproduction of the 
insane since nearly all their children are born before the onset of insanity. 


These conclusions, if proved to be correct, would be of the utmost importance 
to the Eugenist. If the Law of Antedating or Anticipation really acts in the 
way Dr Mott has suggested, then it would seem to be unnecessary to take any 
special Eugenic action in the case of the insane and indeed the “ Law” has already 
been used in support of this view. Thus in a leading article in the British Medical 
Journal+, which deals with Dr Mott’s work, it is stated that “This intensification 
of mental disease in the young—this ‘ anticipation’ as it is called, which is one of 
Nature’s methods of ending or mending a degenerate stock, is specially important 
in connection with sterilization, as the figures given by Dr Mott show that when 
the first attack of insanity occurs in the parent the children have for the most part 
all been born. Sterilization, therefore, would be applicable in relatively few 
cases.” 

It is at least obvious that when views such as these are taken of the “ Law of 
Anticipation,” it merits the most careful examination. Let us consider, then, first 
of all, Dr Mott’s presentation of the case for anticipation. For some years past 
Dr Mott has been engaged in the collection of cases in which two or more members 
of a family are or have been resident in London County Asylums, and has noted 
wherever possible the age of onset of the insanity. Information was thus obtained 
regarding 217 pairs of father and offspring, and 291 pairs of mother and offspring 
and the results are summed up in the following table. 

Thus in comparing the age at onset of insanity in father and offspring, we find 
that among the fathers only 1’£°/, became insane before the age of 20, while among 
the offspring the percentage was 26-2. These figures are also shown graphically in 
* See for instance Problems in Eugenics, p. 426. 

+ May 11, 1912, p. 1089. 
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Figs. 1 and 2*. Here the horizontal scale represents the age of onset in 5-year 
groups—the vertical scale the percentages of cases occurring in each age group. 


* TABLE I. 


Percentages of Cases whose First Attack of Insanity occurred 
within Various Age-periods. 














Age-periods Father | Offspring Mother | Offspring 
Pehl 
| | 
| Percent.| Per cent. | Per cent, | Per cent. 
Under 20 years | 1°4 26°2 06 {| 378 
20—24 years... , 8° 3° | 5°7 
25-29 14 | 180 | 44 | Ie-2 } Adolescence | 
| 80-84 ,, 9°6 13°0 7°8 | 13°4 
= 11°5 73 92 | 10°0 
yt oe 9-2 64 10°3 | 5:8 | 
| 45-49 ,, 14°3 6:0 12°0 | 3°7) Involutional 
| 50-54 ,, 17°5 0-9 123 | 24f period | 
55—59 13°8 3°7 140 | 17 
60-64 > 10°1 ~ 16 | 1: 
65-—69 = 5:0 — 8°8 — 
a ae | 46 0-4 i 
| | 0-4 —" | 18 — 
80 ee 0-4 | — 0°6 -— 


I have been obliged to follow Dr Mott in treating the “ander 20” group as a 
5-year group as otherwise my diagrams would bear no resemblance to his, but this 
procedure is far from satisfactory when such a large proportion of the cases in this 
group are congenital cases in which the age of onset should be taken at 0 years. 
The tables and diagrams show that among the parents more than half the cases 
occur after the age of 50, while among the offspring, more than half occur before 
30, and this is taken to prove that there is Anticipation or Antedating in 
Insanity. 


This will perhaps be made more evident if the percentages of those who became 
insane before the age of 25 are given in each case. Among the fathers, 2°/, and 
among the mothers, 4°/, became insane before the age of 25. Among the off- 
spring, on the other hand, the percentage is 44. Another way of looking at the 
matter is to take the average age of onset of insanity in each case. Dr Mott 
gives a Table showing these averages but unfortunately has omitted the congenital 
cases so that the extent of anticipation is considerably under-estimated, and the 
form in which the data are given does not permit of an accurate calculation of the 
actual averages. From the information given it appears, however, that the average 
age at onset of insanity among the parents is about 50 years, among the offspring 
about 26 years, showing an anticipation or antedating of some 24 years. 


* IT am very grateful to Miss H. Gertrude Jones, the Hon. Secretary of the Galton Laboratory, for 
the diagrams which illustrate this lecture. 
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Now these conclusions, if satisfactorily demonstrated, would obviously be of 
the highest importance, but they were immediately challenged by Professor Karl 
Pearson in a letter which appeared in Nature of November 21, 1912 (p. 334). 
Professor Pearson’s letter is as follows: 


On an Apparent Fallacy in the Statistical Treatment of “ Antedating” in 
the Inheritance of Pathological Conditions. 


The problem of the antedating of family diseases is one of very great interest, and is likely 
to be more studied in the near future than ever it has been in the past. The idea of antedating, 
ie. the appearance of an hereditary disease at an earlier age in the offspring than in the parent, 
has been referred to by Darwin and has no doubt been considered by others before him. Quite 
recently, studying the subject on insanity, Dr F. W. Mott speaks of antedating or anticipation 
as “Nature’s method of eliminating unsound elements in a stock” (“Problems in Eugenics,” 
papers communicated to the First International Eugenics Congress, 1912, p. 426). 


I am unable to follow Dr Mott’s proof of the case for antedating in insanity. It appears to 


me to depend upon a statistical fallacy, but this apparent fallacy may not be real, and I should 
like more light on the matter. This is peculiarly desirable, because I understand further 
evidence in favour of antedating is soon forthcoming for other diseases, and will follow much the 
same lines of reasoning. Let us consider the whole of one generation of affected persons at any 
time in the community, and let 2, represent the number who develop the disease at age s, 
then the generation is represented by 


Ngy Nyy Nay oe Mgyoee 4905 SAY- 


Possibly some of these groups will not appear at all, but that is of little importance for our 
present purpose. 


Let us make the assumptions (1) that there is no antedating at all; (2) that there is no 
inheritance of age of onset; thus each individual reproduces the population of the affected 
reduced in the ratio of p to 1. Then the family of any affected person, whatever the age at 
which he developed the disease, would represent on the average the distribution 


Pm, PM, PNea,--- PMs, +--+ P™in- 
The sum of such families would give precisely the age distribution at onset of the preceding 
generation. 


Now let us suppose that for any reason certain of the groups of the first generation do not 
produce offspring at all, or only in reduced numbers. Say that g, only of the n, are able 
to reproduce their kind; then of the older generation, limited to parents, the distribution 
will be 

Qo MOF G1 M1 + GoNot --- FY gNgt + Y100%1005 
but the younger generation will be 
P (GoM G1 Foot... HYs Met +++ +J100%100) (Ro +... Ng +... + M00); 
i.e. the relative proportions will remain absolutely the same. 


The average age at onset and the frequency distribution of the older generation, that of the 
parents, will be entirely different from that of the offspring and will depend wholly on what values 
we give to the q’s. If frequency curves be formed of the two generations they will differ 
substantially from each other. This difference is not a result or a demonstration of any 
physiological principle of antedating but is solely due to the fact that those who develop the 
disease at different ages are not equally likely to marry and become parents. 
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A quite striking instance of the fallacy, if it be such, would be to consider the antedating of 
“violent deaths.” Fully a quarter of such deaths in males, nearly a half in females, occur before 
the age of twenty years. Consider now the parents and offspring who die from violent deaths ; 
clearly there would be no representative of death from violence under twenty in the parent 
generation, and we should have a most marked case of antedating, because the offspring 
generation would contain all the infant deaths from violence. 

In the case of insanity, is the man or woman who develops insanity at an early age as likely 
to become a parent as one who develops it at a later age? I think there is no doubt as to the 
answer to be given ; those who become insane before twenty-five, even if they recover, are far 
less likely to become parents than those who become insane at late ages—many, indeed, of them 
considering the high death-rate of the insane, will die before they could become parents of large 
families. Now Dr Mott took 508 pairs of parents and offspring, “collected from the records of 
464 insane parents whose 500 insane offspring had also been resident in the County Council 
Asylums,” and ascertained the age of first attack. As at present advised, it seems to me that 
his data must indicate a most marked antedating of disease in the offspring, but an antedating 
which is wholly spurious. There is, I think, a further grievous fallacy involved in this method 
of considering the problem, but before discussing that I should like to see if my criticism of this 
method of approaching the problem of antedating can be met. 

KARL PEARSON. 
Biometric LABORATORY, 
University CoLLteGcr, Lonpon, 
November 11, 1912. 


Dr Mott has referred to this letter in his Report for 1912*, but it will be more 
convenient to deal with his reply after we have examined the method by which his 
data have been collected and the use made of the data. Let us consider first of all 
how the data were obtained. Dr Mott in describing his material says that it 
consists of a collection of cases in the London County Asylums where two or more 
persons are related to one another. Thus Dr Mott has dealt—not with a series of 
complete pedigrees in which every member is included, whether insane or normal, 
but with a series of cases in which two or more members of a family are known to 
have been in London County Asylums. No notice is taken of those who are 
normal throughout their lives and no allowance is made for those who are normal 
at the time the record is made but who may afterwards become insane. 

Do cases selected in this way provide a complete or impartial view of the facts? 
Some of Dr Mott’s own comments on his data throw a considerable amount of 
light on this point. In his Report for 1909+ he says: “From all the Asylums 
I have received valuable reports, but in the case of the older asylums it has been a 
matter of the utmost difficulty to trace the records of so many years back,” and in 
his Report for 1910t he says, ‘Some of the asylum authorities have gone through 
their case books for a number of years back, but the results have not been 
satisfactory owing to the difficulty of obtaining particulars without a living repre- 
sentative of the family being resident in the asylum—for instance, 110 old cases 

* Annual Report of the London County Council for 1912, Vol. 1. p. 62. 


+ Twentieth Annual Report of the Asylums Committee of the L.C.C., p. 90. 
t Twenty-first Report of the Asylums Committee of the L.C.C., p. 94. 
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reported from Bexley have been rejected as the relatives in the other London 
County Asylums could not be traced, for no instance has been included unless full 
particulars could be obtained.” 


It is thus clear that not all the cases could be traced and that there was 
special difficulty in tracing the older cases. What is the effect of a selection 
of this kind? A study of the following hypothetical cases may serve to throw 
some light on this point. 

TABLE II. 


Anticipation or Antedating in Insanity. Hypothetical Examples to 


| First Example | Second Example 





| 
| Mother: Born Mane aaceroteneg tty (yt 1873 1833 | 
OS SE BOE DRT a gee 1893 1853 | 
3ecame Insane and admitted to Asylum | 1913 | 1873 
Age at First Attack ye Hes eel 40 40 
Died iS orn pre ied oes 1914 | 1874 
Son: Born... ae a Se wat es 1894 | 1854 
Became Insane ... ie was | 1894* 1914 
Admitted to Asylum... oa ee 1914 | 1914 
Age at First Attack... aes Red: 0 | 60 | 





The mothers in those two examples have exactly parallel careers. In each 
case the mother became insane at the age of 40 and only lived one year in the 
asylum. In the first case the son was a congenital idiot but was only admitted to 
an asylum at the age of 20. The age of onset in this case is taken at 0 years and 
the case shows marked “anticipation.” In the second case the mother also became 
insane at the age of 40, the son not till the age of 60, 40 years after his mother’s 
death. The second example thus tells against the Law of Anticipation. Are 
these two cases equally likely to appear in Dr Mott’s data ? 

In the first case mother and son are in the asylum at the same time and were 
admitted within a year of each other. It is very improbable that the relationship 
would escape notice and such a case is almost certain to be recorded. In the 
second case, however, the son is not admitted to an asylum till 40 years after his 
mother’s death. Even if the family remained in the same area for 40 years after 
the mother’s death, it would obviously be very difficult to connect the histories of 
mother and son. This case, which tells against the Law of Anticipation, is 
almost certain to escape notice. A spurious anticipation or antedating is thus 
inevitable owing to the method of collecting the data. 

It has also been pointed cut that Dr Mott has made no allowance for those 
who are mentally normal at the time the record is made but may subsequently 


* Congenital Idiot. 
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become insane, and this introduces further spurious anticipation. Another hypo- 
thetical example will perhaps make this clear. Let us take the case of a mother 
with six children, five of whom have become insane as follows: 














TABLE III. 
| Children 
} | 
ae) Sa ee a a ae. 
| oe 2 3 eet She 6 
SOUR PEP OSS. See Klee Thy eee 
} | 
Born ae . | 1830 | 1850 1852 1854 1856 | 1858 1860 
Became Insane “Sr oe 1872 1896 1914 1888 1860+ | 
| Age at Onset of Insanity | 30 —* 20 42 58 30 0 


The extent to which this family would show anticipation or antedating 
would depend very largely on the time at which the record was made as is shown 
in the following table. 





TABLE IV. 

Age of Onset of Insanity in | 
Date of PA eg et ae OP) re Amount of | 
Record Children Anticipation | 
| Mother Children 
—_—— | EE | SS — _ —__— 
1860 | 30 0 i) 30 | 
1872 30 | 0, 20 10 20 | 
1888 30 0, 20, 30 | 16°7 13°3 

1896 30 | O, 20, 30, 42 23 0 sj 
1914 | 30 | O, 20, 30, 42, 58 | 30 0 | 
} 








If the case were noted in 1860 then the age of onset of insanity in the mother 
is 80 years—of the child 0 years—a clear case of anticipation, and nothing 
would be known of the fact that four other children will afterwards become insane 
and will bring the average age of onset in the children up to 30 years—exactly 
the same as that of the mother. Nor is the record even now complete for if the 
eldest child ever becomes insane, the age of onset in his case must be at least 
64 years and this will further increase the average age of onset in the children. 
It is thus clear that in dealing with incomplete families and ignoring the possi- 
bility that those who are normal at the time of record may afterwards become 
insane, Dr Mott has introduced a further spurious anticipation or antedating. 


If we examine carefully the first pedigree given by Dr Mott at the Eugenics 
Congress}, we see clearly how probably much of the anticipation recorded by 


* Alive, 64 years of age and still normal. 
+ Congenital Idiot. 
+ Problems in Eugenics, p. 418. 
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Dr Mott has arisen. Unfortunately this is the only pedigree for which sufficient 


details have been given to enable its completeness to be tested. The pedigree 
and Dr Mott’s description of it are as-follows: 


“A.B., an alien Jew, aged 54, was admitted to an asylum for the first time suffering from 
involutional melancholia ; he has a sister who has not been in an asylum, but, as events turned 
out, bore the latent seeds of insanity. The man is married to a healthy woman who bore him a 
large family ; the first five are quite healthy, then comes a congenital imbecile epileptic (cong.)*, 
then two healthy children followed by a daughter who becomes insane at 23, then a son insane 
at 22, and lastly two children who are up to the present free from any taint. The sister of A.B. 
is married and has a family of ten, seven girls and three boys; one of the females was admitted 
to the asylum at the age of 19, and since this pedigree was constructed a brother of hers has 
been admitted aged 24. Half-black+ circles are insane. The pedigree is instructive ; it shows 
direct and collateral heredity ; it also shows remarkably well the signal tendency to the 
occurrence of insanity at an early age in the children of an insane and potentially insane 
parent.” 





54 


weeseryerr sry en 


€ oe 1¢ 
Cong. 23 22 19 








@ = Insane. 
13 children: 9 Alive, 4 Sons, 5 Daughters. 4 Dead. 3 Insane. 
Fig. 3. Pedigree to illustrate the effect of Dr Mott’s selection of cases. 


F. W. Mott: ‘‘ Heredity and Eugenics in relation to Insanity.” Problems in Eugenics, p. 413. 


This pedigree was given as above in July 1912, and in an address previously 
delivered before the Manchester Medical Society on Oct. 4, 1911, Dr Mott gave 
the same pedigree, but without any reference to the nephew of A.B. (brother of 


“c 


the girl who became insane at 19) who became insane “since the pedigree was 
constructed,” so that this man became insane between 1911 and 1912 and this 


serves to “date” the pedigree. 


Now it should be noted that at least five of the children of A.B. are over 23 years 
of age and up to the present time healthy. But all these children are alive and if 
any one of them afterwards becomes insane, the average age of onset of insanity in 
the children will be raised—and it is clear that the more incomplete the pedigree 


the greater the amount of spurious anticipation. Again Dr Mott states that in 





* This does not agree with Dr Mott’s pedigree which gives the congenital case as the seventh 
instead of the sixth child. ease 

+ According to our usual custom, they are represented by full black circles in Fig. 3. 
Biometrika x 
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nephews and nieces the age of onset is earlier than in uncles and aunts. In 1911 
this pedigree gives a case in which an uncle became insane at 54, his niece at 19— 
but one year later a nephew who became insane at 24 has to be added, thus 
raising the average and there are eight more children some at least of whom 
may become insane at later ages. As before the incompleteness of the pedigree 
introduces an artificial and spurious anticipation or antedating. The remedy is 
obvious ; we must only deal with completed families. 


A further fallacy involved in Dr Mott’s method of work must now be noted. In 
directly comparing the age of onset in parent and child, Dr Mott has ignored the 
fact that in the parent the incidence of insanity is for all practical purposes 
limited to the age of 20 and over since cases of congenital defect and of adolescent 
insanity hardly ever marry. Among the general population of asylums, however, 
12°/, become insane before the age of 20 and in Dr Mott’s selected data the 
percentage rises to 27—or more than a quarter of the whole become insane before 
20. This in itself causes a very marked spurious anticipation. As Professor 
Pearson has shown (p. 361 above) if we were to investigate the age at death in 
parent and child from accident or violence, we should find the same spurious 
anticipation. 


There are thus three fallacies involved in Dr Mott’s work. In the first place a 
spurious anticipation or antedating arises from the inclusion in the record of 
families whose history has not yet been completed, for those who become insane 
at late ages in the younger generation do not appear. Secondly, even with families 
whose history is completed, those cases in which the insanity of parent and child 
is contemporaneous are far more likely to be recorded than those in which the 
child becomes insane long after the parent*, and thus the cases which show 
anticipation are more likely to appear in the record than those which tell against 
Dr Mott’s views. Thirdly, by directly comparing parent and child, he has practi- 
cally limited one of the two groups which are being compared to ages at onset of 
over 20 years and has thus obtained further spurious anticipation. 


Dr Mott also lays stress on the appearance of insanity in a more intense form 
in the younger generation. “I have proved,” he says+, “that there is a signal 
tendency in the insane offspring of insane parents for the insanity to occur at an 
earlier age and in a more intense form in a large proportion of cases, for the form 
of insanity is usually either congenital imbecility,insanity of adolescence, or the more 
severe form of dementia praecox, the primary dementia of adolescence, which is 
generally an incurable disease.” But we have already seen that Dr Mott’s method 
of collecting his data is such that an enormous preponderance of early cases 
of insanity in the younger generation is inevitable and of course such cases are 
largely incurable. Type of disease is very closely related to the age of onset and 

* Dr Mott states (Archives of Neurology, Vol. v1. p. 82) that “the main bulk of the cards (i.e. his 


records), however, refer to parents and offspring admitted to the asylums within the last fifteen years.” 
t Archives of Neurology, Vol. v1. p. 82. 
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by selecting the latter we can alter the proportion of any particular type of 
insanity. Dr Mott has obtained his material in such a way that, in the younger 
generation, cases of insanity coming on late in life are much less likely to be 
recorded than those which appear in early life, and hence the early cases are in a 
majority, but the change in age of onset, and consequently of the type, is entirely 
spurious and arises solely from the way in which the material has been obtained. 


We can now deal with the reply Dr Mott has made to Professor Pearson’s 
criticisms. In his Annual Report for 1912 (p. 62), Dr Mott says: “ Professor 
Karl Pearson, writing to Nature, November 21, 1912, ‘On an apparent fallacy in 
the statistical treatment of “ Antedating” in the inheritance of pathological con- 
ditions,’ criticises on mathematical grounds the evidence of anticipation. I do not 
feel myself competent to reply to the opinion of such an eminent authority on 
mathematics applied to biometrics, but it does not militate against my conclusions, 
nor explain away the fact that a large proportion of the insane offspring of insane 
parents are affected with imbecility or adolescent insanity; for granting the 
assumption that there is no antedating at all, we might rightly expect the ages 
at onset of insane offspring of insane parents to be comparable with the ages at 
onset of all the admissions to the asylums during the same period*. This is by no 
means the case, for amongst the insane offspring there is a far greater proportion 
affected early in life, as is shown in the following figures and curves” (they appear 
here as Fig. 4 and Table V). 

According to these figures the onset of insanity among the recorded insane 
offspring of insane parents is considerably earlier than among the general admis- 
sions to asylums, but it has already been shown that this is due to the fact that 
the data have been selected in such a way that the early cases in the younger 
generation are the most likely to appear. Further, if Dr Mott’s argument be a 
valid one, we might also expect the ages at onset of the insane parents of these 
insane offspring to be comparable with the ages at onset of all the admissions to 
asylums during the same period. This is by no means the case as is shown in 
Fig. 5 below (see also Tables I and V). We see here that the insanity of the 
parents comes on at a much later period than among the general admissions to 
asylums and that there is a far less proportion affected early in life. If Dr Mott's 
method of argument be sound, he has not only to deal with an antedating of 
insanity among the offspring but also a post-dating of insanity among the parents. 
Both are of course spurious and arise from the peculiar selection of the data and 
from the fact that, owing to differential death-rates, the ages at onset of “admis- 
will never be the same as the ages at onset of the admitted—i.e. the asylum 


’ 


sions’ 
population—at any time. 


* «*We might rightly expect”’ these ages to be different, because ‘‘ admissions” are not the same as 
the population in the country who have at one time or another been insane. The percentages of total 
cases of acute mania, of senile insanity, of congenital idiocy, and of melancholia, who reach the asylums, 
are not the same. The reader has to distinguish between the population of admissions, the population 
of admitted, and the insane population of the country. A sample of the latter may be reached from 
completed family histories, but not from records on admission or from records of an asylum population. 
47—2 
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It is possible to illustrate the various fallacies which vitiate Dr Mott’s conclu- 
sions regarding anticipation by considering the age at death of parent and child. 
I do not know whether it is generally recognised that it is exceedingly difficult 
to get any considerable body of data in which the ages at death of a parent and all 
his children are given, for of course the record is incomplete and biassed until the 
death of the last surviving member, and in some cases to get a complete record we 
must trace the history of a family for over 150 years. George the IIIrd, for instance, 
was born in . 738 and all but one of his 15 children were still alive in 1810, 72 years 
afterwards, and the last surviving son, Duke of Cumberland and King of Hanover, 
did not die till 1851, 113 years after his father’s birth—and this is by no means 
an extreme case. In the material I am about to describe I found one case where 
the interval was 160 years. 





Another difficulty which arises is the tendency in practically all family 
histories to omit infant deaths, so that we do not get a complete record. It 
seems probable that the deaths of minors are not represented in such records 
in anything like their true proportion and that the differences are greater than 
might be expected to arise from differences of physique and nurture due to 
class. Thus records of the Landed Gentry give 31 deaths per 1000 males under 
20 years* while actual experience shows 163 to 197 per 1000+. But in the 


records of the reigning families of Europe we get a practically complete record of 


all members and therefore from von Behr’s Genealogie der in Europa regierenden 
Fiirstenhdusert, 1 have extracted particulars of the age at death of over 2000 
individuals—all belonging to the 18th century. There was here no selection— 
every child was entered and every family had been traced from the birth of the 
parents till the death of the last survivor. 


Now in Dr Mott’s data we have already seen that cases in which the age at 
onset of insanity in parent and child is contemporaneous are most likely to be 
recorded. We can test the effect of a selection of this kind by investigating the 
effect of selecting, from our data regarding the age at death among those royal 
families, only those individuals who died within a certain number of years of their 
father’s death, and the results are given below in Table VI, p. 370. 

When we deal with the whole of the data, absolutely unselected, every family 
being complete and traced to the death of the last surviving member, we find that 
680 out of 1829 or 37:2°/, died under 20 years of age. Let us now apply a very 
slight selection to the data and reject the 92 cases in which the interval between 
the deaths of father and child was at least 60 years. We find now that 680 out of 
the remaining 1737 died under 20 years of age—or 391 °/,. Thus the effect 
of a selection of this kind is to cause a slight increase in the proportion of deaths 
at the early ages. If we make the selection slightly more stringent, by taking only 
those who died within 40 years of their father’s death, the percentage of individuals 
dying under 20 years of age rises to 46°7 and if we go still further and consider 
* See Pearson: Proc. R. S. Vol. 65, p. 291. 
t+ Tauchnitz, Leipzig, 1870. 


+ Statistics of Families, p. 73. 
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TABLE VI. 


Illustrating the Effect of Selection of Material on the Distribution 
of Age at Death. 


(Reigning Houses in Europe—18th Century.) 








| | | Children who died : 
All Cases [—_ a i i ae aa a 
Unselected | within 60 years | within 40 years | within 20 years in their 
Age at Data of their of their of their father’s 
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| Under 20} 680 | 37°2 680 | 3971 | 680 | 46:7 | 680 | 624 | 648 | 82°7 | 
| 20—89 277 | 15°1 277 | 15°9| 277 | 19°0 | 254 | 23:3 | 121 | 15:4 
40—59 336 | 18-4 336 | 19°3 274 | 188 | 127 | 11°7 15 | 19 
60—79 | 450 | 246 395 | 22°7 214 | 14°7 29 | 27 — — | 
80andover| 86 | 4:7 49 | 2:8 10 | 71 — | _ _ me 
| ) 
y ht Dar pe mt a natn Wadi: TG 5 ae ree 
Totals | 1829 — 1737 | — | 1455 | — | 1090 — | 784 — 
| | | 
Average | | 
Age at 35°9 33°7 269 162 77 
Death* | 








only those who died in their father’s lifetime, then the percentage rises to 82°7 °/,. 
Looking at the matter in another way we find that the average age at death has 
fallen from 35°9 years to 7°7 years. 

The same facts are given in Fig. 6, which shows that as the selection of cases 
becomes more stringent, there is a regular increase in the proportion of deaths at 
the younger ages. In exactly the same way, the fact that cases where the insanity 
of parent and child is contemporaneous are the most likely to appear in Dr Mott’s 
records causes a spurious exaggeration of the cases of insanity at early ages in the 
younger generation and consequently a spurious exaggeration of the number of 
cases of imbecility and adolescent insanity. 

We can also investigate directly the question of anticipation or antedating 
on this material. In order to avoid the heavy weighting of large families which 
would arise if every child were entered, I have taken only one child from each 
family. Let us consider fist of all the distribution of age at death of Fathers 
and their First-born Children. The facts are given in Table VII. 

We have altogether 294 cases in which we know the age at death of a father 
and his first-born child. None of the fathers died before 20 but of the children 


* These averages were calculated, not from the five age groups given above, but from the same 
material classified in 15 age groups. 
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106 out of 294 or 36:1°/, died before 20. The average age at death among the 
fathers is 61 years, but among the children it is only 36 years, so that there is an 
anticipation of 25 years. To borrow Dr Mott’s words, the figures clearly show 
the signal tendency among the offspring to die at a much earlier age than 
their parents; that is to say, anticipation or antedating is the rule. 


Age at Death. 
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Age at Death. 
Fig. 6. Diagram to illustrate the Effect of Selection of Material upon the Distribution of 
; Age at Death. (Reigning Houses in Europe, 18th Century.) 
. Now in this material there is no selection of families. Every family was taken 
and the age at death of every first-born is known, so that we are only left with the 
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TABLE VIL. 
Showing Anticipation in Age at Death. A. Fathers and Children. 
(Reigning Houses in Europe—18th Century.) 














| _. | First-born First Sons who | 
| Age at Death Fathers Children Fathers | had children | 
| oe ae ae ae 
0~— 9 oa 95 an a 
10—19 os 11 — — 
20—29 ate oe 4 8 
80—39 | 16 18 8 15 
4O—49 | 45 | 31 31 39 
50—59 | 70 | 34 54 39 
60—69 | 1 i See 37 58 44 
v0—79 eae 46 | 54 
80—89 | 18 | 14 12 13 
90 and over — | at ws l 
Totals | 294 294 213 213 
| Percentage dying under 20 | 0 36°1 0 0 
| = 
oe as ieee SLSR LS oe ee Rd 
Average Age at Death ...| 61 | 36 60 59 
} } | 
| Anticipation bs es 25 l 





third of Dr Mott’s fallacies, in that no allowance has been made for the fact that 
the parental group is limited to ages over 20 while more than a third of the off- 
spring die under 20. The effect of this selection can be removed almost entirely 
by taking instead of the first-born child, the first son who married and had at least 
one child. There are in all 213 such cases and we see that there is now no 
anticipation. The difference between the average ages at death is less than a year 
and by removing the artificial selection we have got rid of all anticipation or 
antedating. 

These facts are also shown graphically in Figs. 7 and 8. The horizontal scale 
gives the age at death in 10-year groups while the vertical scale gives the actual 
numbers of parents and offspring dying in each age group. The diagram on the 
left shows marked anticipation, and should be compared with Dr Mott’s diagram 
(Fig. 1) in which the ages at onset of insanity of father and child are compared. 
When, however, we get rid of the selection of cases by taking only sons who have 
had children, then there is no anticipation. 

If we compare the distributions of age at death in mothers and children we get 
exactly the same results. The facts are shown in Table VIII. 


We see that the first-born children died on an average 18 years before their 
mothers, but when we compare the age at death of mothers and the first son in 
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TABLE VIII. 
Showing Anticipation in Age at Death. B. Mothers and Children. 


(Reigning Houses in Europe—18th Century.) 


First-born : First Sons to 
Age at Death Mothers Children Mothers hava Children 
| eto 
0 9 122 
10—19 2 13 l 
20)—29 17 26 21 8 
30—39 49 22 30 16 
4O—49 43 32 25 41 
50—59 52 35 39 40 
60—69 80 42 52 46 
10—79 52 36 41 54 
80—89 18 17 10 14 
90 and over 2 | I 
Totals 345 345 220 220 
| Percentage dying under 20 6 39°1 *b 8) 
e ae 
| Average Age at Death ... 53 35 55 59 
| Anticipation vn ize 18 —4 





each case to have children, then the sons live four years longer than their mothers. 
It would have been better in this case to have compared the mothers with the 
first daughters to have children but unfortunately von Behr gives very little 
information regarding the female lives, except in special cases. The figures show 
a marked anticipation in age at death when we directly compare, as Dr Mott 
has done, mother and child, but this vanishes when we remove the arbitrary 
selection. The same facts are shown graphically in Figs. 9 and 10. 


If we combine these figures we can compare the age at death of parent and 
child and the results are shown graphically in Figs. 11 and 12. 

Fig. 11 shows that Dr Mott’s limitation of one of the two generations he is 
comparing to adults, without imposing a similar limitation on the other generation, 
introduces an artificial and spurious anticipation. The average age at death of 
the parents is 56 years and of their first-born children only 35 years—so that we 
get an anticipation of 21 years. If, however, we make the two generations 
almost directly comparable by dealing only with sons who have children—there is 
no significant difference between the two averages (58 against 59 years). 


In these cases we have dealt only with completed families and have taken 
every family without selection. If, however, we consider only the cases in which 
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the eldest child died in his father’s lifetime the amount of anticipation is greatly 
increased. The facts are shown in Table IX and in Fig. 13. 
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children died under 30; the average age at death among the fathers was 62— 


among the children only 10, showing an anticipation of 52 years. 
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TABLE IX. 
Showing Anticipation in Age at Death. C. Fathers and First-born 
Children who died in their Fathers’ Lifetime. 
(Reigning Houses of Europe: 18th Century.) 
First-born Children 


Age at Death Fathers dying in their 
Fathers’ lifetime 


0 9 89 
10—19 1] 
20—29 16 
30—39 6 10 
4O— 49 20 5 
50—59 32 2 
60—69 33 
70—79 32 
80—89 10 auld 
Totals 33 33 
Total 133 133 
Percentage dying under 20 0 75°2 
Average Age at Death ... 62 10 
Anticipation a wee 52 


It is now possible to illustrate the effect of the principal fallacies which vitiate 
Dr Mott’s conclusions. In the first place he has dealt with families which are 
largely incomplete and has collected his material in such a way that cases in 
which the insanity of parent and child is contemporaneous are the most likely to 
be recorded ; in the second place he has directly compared parent and child with- 
out allowing for the fact that practically no parent can become insane before 20, 
while there is no limitation of this kind among the offspring of these insane 
parents. 

In Table IX and Fig. 13 we see the effect of dealing with incomplete families 
in which the children died in their fathers’ lifetime. There we get an anticipa- 
tion of 52 years. If we get rid of the first and second fallacies involving a 
selection of cases by dealing with every family, as shown in Table VII and Fig. 7, 
the anticipation falls to 26 years. If we get rid of the third source of fallacy 
also, by comparing the fathers with the first sons who have children, asin Table VII 
and Fig. 8, then the anticipation falls to less than a year. The Law of Antici- 
pation or Antedating has thus in Dr Mott’s case no foundation, in fact it is a 
spurious result of the mode of collecting and interpreting data. 

Now Dr Mott has not only asserted that this “ Law” applies to insanity but 
has also drawn the conclusion that the offspring of insane parents if still normal 
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at the age of 25 may safely marry. In an address delivered before the First 
International Eugenics Congress*, he said: “ You will observe that 47°8°/, of the 
500 offspring had their first attack (of insanity) at or before the age of 25 years 
and as you see in the curves of parents and offspring, the liability of the child of 
an insane parent becoming insane tends rapidly to fall. Now besides the fact 
that this shows Nature’s method of eliminating unsound elements of a stock, 
it has another important bearing, for it shows that after twenty-five there is a 
greatly decreasing liability of the offspring of insane parents to become insane 
and therefore in the question of advising marriage of the offspring of an insane 
parent this is of great importance. Sir George Savage recently said that this 
statistical proof [sic !] of mine entirely accorded with his own experiences, and that 
if an individual who had such an hereditary history had passed twenty-five and 
never previously shown any signs (of insanity) he would probably be free and he 
would offer no objection to marriage.” 

Now I entirely fail to understand how anyone could recommend marriage in 
such cases, even on Dr Mott’s own figures; for if it be true that 48 °/, become 
insane before 25, it must be equally true that 52°/, become insane after that age 
and this very important point seems to have been forgotten. These figures, 
however, are taken from Dr Mott’s selected data, selected in such a way that the 
early cases are enormously exaggerated. Until Dr Mott publishes a series of 
complete pedigrees, it will be safer to assume that the age at onset of insanity 
among the offspring of insane parents does not differ widely from that of all 
admissions to Asylums and there we find that only 21°/, become insane before 25, 
and 79°/, after 25. 


But surely at a Eugenics Congress of all places some thought might have been 
given to the mental condition of the children resulting from such matings, before 
advising marriage. It would not have been difficult for Dr Mott to have extracted 
all the available cases of this kind from his collection of pedigrees, i.e. all cases in 
which an individual had an insane parent and was normal at the age of 25, and so 
have discovered the probable fate of the offspring from such matings. 


Unfortunately the details given by Dr Mott regarding his pedigrees are usually 
so scanty that little use of them can be made, but two at least show the danger of 
the matings Sir George Savage and he sanction; these two pedigrees were given 
by Dr Mott in his lecture on Heredity in Relation to Insanity, delivered to the 
members of the London County Council. The first is shown in Fig. 14. (It 
appeared as Fig. 11, p. 18 of Dr Mott’s lecture.) In the first generation a man 
who became insane at 70 had four children. The eldest, a girl, became insane at 
68 and was therefore normal long after the age of 25. Dr Mott does not state 
whether the marriage of this woman preceded or followed the onset of insanity in 
her father, but even if her father had become insane before her marriage, Dr Mott 

* Problems in Eugenics, p. 425. This is one of many illustrations of the evil done by that Congress ; 
attention was directed and much weight given to hasty statements and ill-digested material. 
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would have raised no objection to the marriage since the woman herself was not 
insane. There were in all six children from this marriage of which Dr Mott would 
have approved. Two became insane, three were blind and five are said to have 
been paupers. 
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The eldest child remained normal till the age of 34 and although both his 
parents became insane Dr Mott apparently would not have objected to his marrying. 
He did so and one of his children became insane and eight out of nine are said to 
be paupers. These nine children are apparently still young so that their ultimate 
fate is still uncertain. 


The second pedigree I shall quote was given as Fig. 28, p. 33 of Dr Mott’s 
lecture, and appears here as Fig. 15. 


A man who had an insane father and an insane grandfather became insane at 
the age of 55. He was therefore normal at the age of 25* and Dr Mott would 
have sanctioned marriage in his case. He actually married twice. His first wife 
was tuberculous but not insane; they had two children, both insane. His second 
wife was normal and it is definitely stated that there was no insanity in her 
family; they had five children and one of these became insane. Yet Dr Mott 
would permit the children of insane parents to marry if only they are normal 
at the age of 25! 


Again, Dr Mott has stated that it is useless to attempt to limit the fertility of 
the insane since most of their children are born before the onset of insanity, 
and therefore before any action can be taken. From his statistics of relatives in 
L.C.C. Asylums, Dr Mott has calculated the proportion of offspring who were born 
after the first attack of insanity in the parent and found that “Forty-six offspring 
out of 581 were born after the first attack of insanity in the parent, ie., 7°9°/,. 
That is to say in the case of 529 insane parents, the birth of only one-twelfth of 
their 581 insane children would have been prevented by sterilisation or life segregation 
of the parent after the first attack of insanity. These figures refer to the offspring 
which become insane, but there are a large number of offspring which do not become 
insane and these would be cut off if life segregation or sterilisation were adopted +.” 


But here again Dr Moit is using the data obtained from his index of relatives 
which shows a greatly exaggerated number of cases at the earlier ages among the 
offspring, and he thus greatly exaggerates the number of cases in which the children 
were born before the onset of insanity. No conclusion can be drawn from any but 
complete records of families, But apart altogether from this, many of these parents 
are themselves the children of the insane and much could be done to discourage 
such marriages. Unfortunately as we have seen Dr Mott directly sanctions 
marriage to those who remain normal till the age of 25. 


In further support of his view Dr Mott has stated that out of 642 females 
admitted to three London County Asylums in 1911, 148 were recurrent cases and 
of these 32 (21°/,) had children between their respective dates of admission. 
“The inference that can be drawn,” he says, “is that about one-fifth of the 
recurrent cases, or approximately one-twentieth of the female admissions have 


* If the term “‘age at onset” has any real meaning. 
+ The italics are Dr Mott's. 
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children after their first attack of insanity and of 31 such cases examined, 73 chil- 
dren were born after the first attack of insanity in the parent.” 

But have these 148 recurrent cases been followed up to the end of the repro- 
ductive period? Not at all. No ages are given and the cases are merely those 
which were admitted to Asylums in 1911, Dr Mott’s remarks being made in 
June 1912, so that no attempt has been made to follow them up. There is no 
justification for Dr Mott’s advice. 


There are many other points in Dr Mott’s work which deserve detailed exami- 
nation, but time will not permit more than a brief account of a few of them. 


It should be noted, for instance, that Dr Mott has used his index of relatives 
in London County Asylums as an argument in favour of the importance of the 
inheritance factor in insanity. His argument is as follows: 


“At the present time in the London County Asylums there are 725 individuals 
so closely related as parents and offspring, brothers and sisters. A priori, this, to 
my mind, is striking proof of the importance of heredity in relation to insanity, 
for we cannot suppose that 20,000 of the 4} millions of people in London brought 
together from some random cause would show such a large number closely related 
as 3°6°/,.” 

But Dr Mott has not attempted to give, and I doubt if he ever will be able to 
give, a satisfactory estimate of the number of relatives in even a random sample 
of the population, and the population of asylums is far from being a random 
sample of the general population—there is for instance an extraordinary divergence 
in age. Yet without definite information on this point it would be impossible to 
say whether insanity is inherited or not—that is if we had to depend solely on 
Dr Mott’s data, 

It should also be noted that in these cases Dr Mott has clubbed together 
every form of insanity, from congenital idiocy to senile dementia, except of course 
cases due to specific infections or trauma. I myself think that course is the only 
possible one. To anyone who has studied even a few pedigrees of mental defect, 
nothing is more striking than the extraordinary number of different forms of 
mental defect that may appear in the same family. 

Seven years ago, in a First Study of the Statistics of Insanity and of the 
Inheritance of the Insane Diathesis*, I was confronted with the same problem, 
and after a full consideration of all the available data and of the opinions of those 
w dical men who were best qualified to express an opinion came to the conclusion 
t: t the only possible course was to group all forms of insanity together, with, 
of course, the exceptions I have already indicated. The whole question was dis- 
cussed very fully in my paper and it was there suggested that an even broader 
classification might be of service. This point of view met with some criticism at 
the ‘ime but nothing has occurred to alter it, and the study of the inheritance of 


* Galton Memoirs, No. Il. (Dulau and Co.) 
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insanity in general or of an even broader degeneracy must always remain the 
first object of our studies. 


Any investigation of the inheritance of special types of insanity or degeneracy 
can only be carried out however on unselected material—on the records of com- 
plete families. The type of insanity is so closely related to the age of onset that 
any tendency to exaggerate the number of early cases, as in Dr Mott’s material, 
will entirely vitiate the conclusions drawn. Thus Dr Schuster’s conclusions as to Hi 
the inheritance of special types of insanity based upon Dr Mott’s data* must also 
be rejected on the above grounds. 





Dr Mott’s index of relatives in London County Asylums is unfortunately of 
very little value in the study of inheritance in insanity. Progress can only come 
from the study of complete pedigrees in which every member of the family is 
entered, whether insane or normal, and the ages of the normal at the time the iH 
record was made are just as important as the age at onset of insanity in the insane 
members, for a statement that a young man of 20 has not been insane is of a | 
very different degree of importance from the statement that a man of 70 has 
not been insane. 





In the papers I have cited the children of the insane if normal at 25 are 
advised to marry, and it is asserted that it is useless to attempt to discourage the 
reproduction of the insane since most of their children are born before the onset of 
insanity, and that we should rely on the Law of Anticipation to end or mend 
degenerate stocks. 





I have shown, I think, that the Law of Anticipation as applied to the insane 
has no foundation in the facts provided and that the advice given as to the marriage 
of the insane and of their normal offspring is fundamentally unsound and directly 
cacogenic. Much yet remains to be learnt regarding the inheritance of the insane 
diathesis, but no one who has studied the family histories of the insane can doubt 
that in inheritance we have by far the most important element in the production 
of insanity, and in view of all the facts it is the obvious duty of the Eugenist to 
discourage, rather than to encourage, procreation by the insane and even by those 
of their offspring who appear to be normal. 

* Report on the Statistical Investigation of Relative Cards, 21st Annual Report of the London 
County Council Asylums Committee (1910), p. 95. 














ON THE PROBABLE ERROR OF THE BI-SERIAL 
EXPRESSION FOR THE CORRELATION COEFFICIENT. 


By H. E. SOPER, M.A. Biometric Laboratory, University of London. 


In a recent paper* Professor Pearson shows that where one character is in 
multiple graded grouping and the other in alternative categories, greater or less 
than a given magnitude, the correlation coefficient admits of simple expression ; 
the assuinptions being that the unmeasured character, B, has a normal distribution 
and that the measured character, A, has linear regression upon B. Under these 
conditions the data required are the numerical ratio of the alternative groups, 
the standard deviation of the measured character and the deviation from the 
general mean of this character of the mean of one of the groups. 

This expression is subject to greater fluctuations of value in samples of WV 
of the population than is the product moment form, especially where one of the 
groups is relatively small; and it is proposed to find formulae for the mean and 
second moment of the errors from this mean to a first approximation, that is to 
terms in 1/N. These will appear in terms of the correlation coefficient, r, of the 
original population (which will be supposec normally correlated) and the fractional 
frequency, f, in that population of the group possessing the greater or positive 
intensity of the character put into two classes, 

Let y be the graded character and @# the alternative character the intenser 
value of which is possessed by the fraction f of the population. Let %, 7 be the 
general means and cz, o, the standard deviations of # and y. Then it is shown 
in the paper that if 2’, 7 are the means of the group /f, 

ho ox 


r in es clfas ines ap ibaes Seas ace (1), 


Cy au—a 


on the assumption that the regression of y upon « is linear; and that if «# be 
normally distributed this is equivalent to 


* Biometrika, Vol. vu. (1909), p. 96: ‘*On a New Method of Determining Correlation between 
a Measured Character A, and a Character B, of which only the Percentage of Cases wherein B 
exceeds (or falls short of) a given intensity is recorded for each grade of A.” 
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where z is the ordinate of the normal curve cutting off the area 
f=h(.—a) or $(1+@) 
as defined in Sheppard’s Tables of the Probability Integral. 
Now if Po. Pr» Pa, ete. 


be the moment coefficients of the whole population with respect to the character 


y defined by 
at NTN. vidsadscaveorescexeduseneeceaened (3), 


[see bi-serial table below which is here to be looked upon as representing the 
general population] and 
Por Pr, Pz, ete. 


are moment coefficients of the group n’ (=f) defined by 
De Rg BOD. acacscesccscenententeussseenges (4), 
we may write f=p), fy’ =p, Y= Pr, Ty = V(po— Pr) and 


ci Nine Po Pr x 1 











< mtr. (ee Galva Ghumeadereedte pet ewe (5). 
V(p2—pr) 2 
— > Grades of y in Bi-serial Table. 

; n," Ne nN” Re n” 
4 > ’ oi 
g | m' | me ns Ne nw ] 
. 
§ apie BP. 

Ny Ny Ns ee ee. “Ui ieee les aR Sherecins (6). 

n'/N=f 


In samples of N the frequencies n,, n;, n;’ and consequently the moment 
coefficients p and p’ and the ordinate z are subject to fluctuation and the values 
of the correlation coefficient calculated from this formula will have a distribution 
of errors. Let 7 be the mean value in such samples, dr the deviation from this 
mean value in any sample when dp, 5p,, Spy, ete. 8z are the deviations in moments 
and ordinate. Then 


= Sp Pt + 8p — (po + 8p) (pi + Spr) 


7+6r= ee 7). 
V {po Spe — (p, + 8p,)"} x (2 + 82) “) 
To express §z in terms of deviations of the moments we have 
7 ~ ha? 
a= 7 BY:  ekest taverns boresenc aon mes} 8 
V Qer 


where a is the abscissa of the point of section of the normal curve, defined by 
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Hence to second order terms in powers and products of deviations 


+82 1 g- 3 (a+sa)? 


=z{l—ada—4(1—a?) (8a)} 





at+éa 1 a 
— df= — ¢~ 3" dy 
7 Ya \Qar 
ba 1 


= - _— e~ b(a+é? de 


=: [*-ag-401 — a?) 2+...) dE 


=z {da — 4a (da)*}. 
It follows that 


=z+adp, — hs BT . Weresas Rkninsereeted (10). 


At the same time that this value is put in (7) we may simplify the expression 
and subsequent algebra by supposing the graded character y to be measured 
from its universal mean value as origin and in terms of its standard deviation as 
unit of measurement in which case 

r=0, poe=o,=1, and by (5) py HZr  .reeeecereeeees (11), 
and since p, =f (7) becomes, 


zr + dp,’ — f Sp, — Spy Sp, 





7+o6r= ; me ceeeeeaeens (12) 
/{1+ dp. — (Sp,)*} x \? + adp,'— 5; (Op t 
Expanding to second order of deviations we find, 
We Oi Fey Oe encod ccnaveessnsdusvescocessel (13), 


where 6,, 5, are the first order and second order expressions 


1 


8, = — {8p,’ — fSp, — $2r Sp, — ar Spy}, 


5,= : jdardp.3p,' - - 5p,’ dp,’ 41 5p, dp. — $6, Sp, + $f8p, Sp, 


3 P 
g 2" (Ops) + (1 + 2c?) = (Spo rt sseees (14). 

Sate sb OIE EL: dc cxsecseavuacssseseyscasccsviels (15), 
mean 6, being zero since by (3), (4) 


— dp, dp, + zr (Sp,)? + 


Taking mean values 


mean dp, =S (mean dn,y,”)/N =0, 
mean dp,’ = S (mean &n,’ y,”)/N =0. 
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Mean 64, is to be evaluated by the formulae 
mean dp,5p, = (Pus+v — PuPo)/N 
mean bp,’ dpy =(p'usv — Pu Po )/N E hevkwetutbbecadedee (16), 
mean dp, dpy =(p'us» — Pupoe)/N 
of which the first two are well known* and the third may be proved thus: 
N Spu =S (Sngys") = S (Sns' ys") + S (Sn, ys"), 
N Sp,’ =S (dn; y;”), 
*. N*Spudpo' = 8 {(Sng)? ys"*"} + S [Sng Sng’ ys" ye} + S [8ng' Sng’ ys’ yo" 


where in the third sum s’ may or may not equal s. 


’ 


But mean (8n,')? =n, (1 —n,/N), 
mean (8n, dng’) =— n,' ny’ /N, 
mean (dn, dny’) = — n,'n,"/N, 


the last whether s=s’ or not. Thus we findt 
mean dpudpo =(p'ur»— Pu Pv — Pu Po) | N =(p'usv— Pupo)/N. 


Evaluating mean 6, by these formulae we find, 
a 1 , , a , ae 
r= rt 7. sar (pe — P2Po p= =i i — Pi Po) 


fa , , / , ¥ 
+> (Pi — PiPo) — 3 (Ps — Px P2) + 3 (Ps — Pips) 
; j ee. Ay = e 
— (Pr — Po Pr) + $27 (ps — pr’) + 3 2r (pa— ps*) + (1 + 2%) = (po - pi) --(17), 
in which the undashed moments, being those of a normal curve with unit standard 
deviation, about its mean, have the values 
~ri=0, pe=1, ps=0, p=3, 
and the dashed moments beyond the first two, 
Po=f, pi =2r, 
have values depending upon the nature of the frequency distribution of y and a. 
Assuming #, y normally distributed t 
Fin rp? po ‘ ] — ce { x? — 2ray + y?} 
PE=} | P aeay? BO EO dedy, 

f= ff yp a I a0 { x? — 2ray + y*} dad 
Ps : ae " Qar /(1 — r*) y- 


* See Biometrika, Vol. um. (1903), p. 275: ‘‘On the Probable Errors of Frequency Constants.” 
K. Pearson. The second follows in exactly the same manner as the first, since the constancy of the 
total frequency dealt with is only involved, in deducing the relations (i) and (ii) of p. 274, of that 
paper, 

i yd =8 (n,”y,.")/N. 

+ Since the moments appear in the term containing 1/N any errors in their calculation due to 
incorrect assumption of normality will not affect the present approximate formulae provided such errors 
are of the order 1/N. 
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Putting y=ra +7 and integrating with respect to » for constant «, 


re os 2 
pi=[” {rep +(1—19)} 07 #*de 
Ja } V 20 


OG YS on iiateshcdicsspnceeesastbi cba ctecssa ce sus tr veeteeceie (18), 
Pp; = e {(ra) + 3 (rx) (1 - r*)} Ee e~ "dx 
7645 V 2a 
eRe OOM EIOES.. “oh can Caer anaqos ese ceaseaeeeumatetesre ee (19). 


When these values are put in (17), and terms collected, the mean value of the 
bi-serial correlation coefficient in samples of NV is found to be 


- | I yf, Se ee at 
. ye 4 Aa fe ba) Yen Lye | | ) 
Far {i+ 5 [2414 \ “\(14 = )+4r]] Satta coun (20), 


£2 Zz] 


where f =1—f=n"/N. 


In the work of obtaining this approximation all powers and products of 
deviations above the second order have been neglected. The means of such 
terms in samples of NV involve second and higher powers of 1/N* and the present 
result is correct to the first approximation. 


Again squaring (13) and taking mean values and subtracting the square of 
(15) we find to the same approximation as before 


o,? = mean (or) = mean 6, .........scccccencecseess (21). 


The evaluation of mean 6, being carried out precisely in the same way as 
mean 6,, the result is the second moment of deviations of the bi-serial corre- 
lation coefficient in samples of JN, 


the values of ¢,, y.’+ and y, for values of 4(1—a) [= the smaller of n’/N, n”/N] 
from °50 to ‘01 are to be found in table (24), 


* See Biometrika, Vol. 1x. (1913), pp. 97—99. 
+ xa for 4(1+a) was tabled in Biometrika, Vol. 1x. p. 27, and the table is reproduced in Tables for 
Statisticians and Biometricians, p. 35, Cambridge University Press. 
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| $(1 —a) a | Xa” Va 3 (1—a) | ha | Xa" Yo 
|— pf |) 4 
50 0354 1°5708 | 2°5000 20 | — 871 | 2°0414 | 2°8578 
49 0353 1°5711 | 2°5003 ‘19 | — °1001 | 2°0898 | 2°8951 
48 0350 1°5722 | 2°5011 ‘18 | — °1146 | 2°1437 2°9364 
“47 0346 15741 | 2°5024 ‘17 =| — °1808 | 2°2035 2°9825 
46 0339 1°5766 2°5043 16 | — °1490 2°2703 3°0341 
45 0331 1°579% 2°5068 15 — ‘1696 2°3453 3°0923 
“4A 0321 1°583% | 2°5098 “14 | — 1931 | 2°4303 3°1582 
“43 0209 15886 | 2°5134 13 — °2201 | 2°5272 3°2337 
42 0295 1°5943 2°5177 12 | — *2513 | 2°6389 3°3208 
“41 0279 1°6007 2°5225 ‘11. +| — -2880 | 2°7687 3°4224 
“40 0260 1°6079 2°5279 “10 — ‘3317 | 2°9221 3°5427 
| °39 0240 1°6161 2°5341 095 — +3568 3°0095 36115 | 
| 38 ‘0217 1°6251 2°5409 090 | —. *3844 3°1057 3°6873 | 
| *37 0191 1°6351 2°5484 O85 — 4153 | 3°2119 3°7713 | 
36 0163 1°6461 2°5568 080 — +4499 3°3299 3°8648 | 
85 “0132 1°6582 2°5659 O75 — °4887 3°4620 3°9696 | 
Bh “0098 1°6714 2°5759 070 — 5324 3°6110 40879 | 
33 0062 1°6858 2°5868 065 — *§822 3°7806 4°2294 | 
| 32 ‘0021 17015 | 25986 060 | — 6403 | 39748 | 4:3777 | 
|} ‘sl — 0023 1°7186 2°6115 055 — *7083 42002 4°5584 | 
30 — 0070 1°7371 26256 050 — °7897 4°4652 4°7723 
‘29 — 0122 1°7573 2-6409 O45 — ‘8868 | 4:7829 50283 
“28 ~-0179 17791 2°6575 040 —1:0053 | 5°1715 
27 — 0241 1°8028 2°6755 O35 —1°1577 5°6568 
"26 — *0308 1°8286 2°6952 ‘O30 — 1°3556 6°2859 
25 — 0382 1°8567 27166 025 — 1°6308 7°1347 
“24 — 0462 1°8874 2°7399 020 — 2°0272 8°3600 
23 — *0550 1°9208 2°7654 “O15 — 2°5263 10°3024 
“22 — 0647 1°9574 2°7933 010 — 3°8889 13°9393 | 
| “22 — 0753 1:9974 2°8240 
| | 





The bi-serial value of the correlation coefficient has the standard deviation 
9 9 / T 
Or = (Xo? — War? + 1*)/VN, 
whilst that of the product moment value is 
o,=(1—7")//N. 
In table (25) a comparison of the values of the numerator is made for five 
values of r, for divisions at 0, °5,...2°5 times the standard deviation from the mean 


of the ungraded character. 


Values of ,/(xa2- War? +r4) for 4 (1-a)= 
Values 


(l—r?) | +500 309 "159 067 023 006 


00 1°00 1°25 Ge te so 1°93 7G |. £9 
25 ‘9375 Li 1°25 1°45 1°86 2°68 4°3 
0 *750 1°00 1°06 1°26 1°65 2°42 4°0 
75 "4375 69 “75 | “94 1°30 1°95 3°2 
| 1°00 00 O7 | 33 "49 “74 113 1°8 





Biometrika x 
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Thus the effect of grouping and applying the bi-serial value of the correlation 
coefficient is to add 25°/, to the probable error in the most accordant case where 
r is zero and the division equal, whilst if r is as large as ‘5 and one group as small 
as 10°/, of the whole the probable error is nearly doubled. For higher values of 
r the errors of sampling, in the case of the product moment formula, grow smaller 
and ultimately vanish when r=1; but the bi-serial values are not invariable in 
samples drawn from a perfectly correlated population but possess a variability as 
high as ‘27//N in the most favourable case when the grouping is equal. 

If the standard deviation be calculated from the approximate formula, 

WeAE LT MP EILE cxvicriceeccccndtcdasiczensd (26), 
which may be written * 
ites PUREE soho iieac cickeneceaval (27), 


the error of computation will not be great for values of f and r commonly met 
with as the following table compared with the last will show: 


Values of xa—r? for $(1-a)= 


x 
=) 
S 


309 | -159 | 067 | 038 | 006 
| i DE. & ahs | sD | 
| 00 1°25 131 | 161 | 193 2°76 45 | 

25 119 | «61:5 «| (245 1:86 | 2°70 44 | 
| “50 1:00 1:06 | 1:26 168 | 2°51 42 | 
75 ee 1:36 | 2°20 oe 
| 1-00 ar | 5 35 


‘51 | ‘93 | 76 | 
| gone (28). 





The difference between the two expressions only reaches 5‘/, when the 
smaller group is less than 7 °/, of the whole. 
5 / 


° 


It will not be necessary, excepting in small samples, to apply a correction 
to the bi-serial formula for r in virtue of the mean of samples differing from the 
population value. The correction is less than 1/Nth part of the value calculated 


° 


unless one of the alternative classes is as small as 4°/, of the whole. 


I have to thank fellow members of the Staff for assistance in calculating the 


tables. 


* For a table of xa see Tabl-s for Statisticians and Biometricians, p. 37. 














ON THE PARTIAL CORRELATION RATIO. 
PART I. THEORETICAL. 
By L. ISSERLIS, B.A. 


§ 1. The theory of non-linear regression in the case of two correlated variables 
is due to Prof. Karl Pearson*. He shows that regression ceases to be linear 
when the correlation ratio differs sensibly from the correlation coefficient 7 and 
establishes criteria for parabolic, cubic and higher forms of regression. 


The present paper deals with the regression surface of three correlated variables 
x, y, 2, Where, though the regression of z on 2, y cannot be adequately represented 
by an equation of type 


oz ox Cy 

the regression of z on « for a constant y and of z on y for a constant « is linear. 
A large proportion of the non-linear cases that occur in practice fall into this 
class. It will be remembered that Z,, in (1) denoting the mean of the array of z’s 
for a given # and y the coefficients y;, 3’ are partial regression coefficients and it 
will appear that just as it is necessary to introduce the correlation ratio » for an 
adequate description of non-linear regression of two variables, there must be 
introduced multiple or partial y’s for the description of such regression in the 
case of more than two variables. 


We recall the definition and principal properties of .»,—the correlation ratio 
of y on #, oq, being the square root mean weighted square standard deviations of 


the arrays of y: 


S(nzo", SS {nay (y — Jz) } 
(l-7)o,7= ea — paste | (2), 
wre he Sine ? . ma en UnCut se Te ehode (3), 
oy iNOy 
and N (9? — 17) oy2=S {re (Yng— V)}  ceeseeveseeeesereeees (4). 


* Drapers’ Company Research Memoirs. Mathematical Contributions to the Theory of Evolution. 
XIV. ‘‘On the General Theory of Skew Correlation and Non-linear Regression.” 1905. 


50—2 
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Here we are dealing with NV pairs of two characters A and B. ny of these have 
the character w of A. J, is the mean of this z-array of B’s. op, is the standard 
deviation of this array, on, is the weighted standard deviation of the means of the 


arrays and Y is the value of y given by the regression straight line, i.e. 


This is the “best fitting” straight line (in the Gaussian sense) to the means 
of the arrays, and is the regression line when the regression is linear. 
2. Consider now three correlated characters A, B,C. If N combinations 
’ 
of A, B, C are taken, we may denote by n, the number of these which have the 
character A=a and by n,, the number in which A has the value # and B has 
the value y. Let %, 7, Z be the mean values of the total population, and let Z,, 
be the mean of z for a given # and y. The frequency of A=, B=y, C=z 
1S Ngyz- 
We define the correlation ratio of z on # and y, which may be denoted by 
ayfz, or if no confusion is likely to arise by H,, by the equation 
YOY x 2) 
a SSS {n 2(2 ~- % yt ~ 
o?7(1—H?]= ; “i a th ecctinevecsduveneeve (6). 
The triple sum in the definition can be written 
YO = = = 2 
SSS {ayz(2 — 2+ Z — Zzy)"} 
CY = = 9 ¢ ‘< = = Y = C " =\2 
= SS {Nyy (Z — Zzy)*} + 2S {(Z — Zry)}  S {Nayz (2 — Z)} + SSS {nayz (2 — 2} 
=SS { ary (z— Zany} —2SS (Nay (Z- Zey)*} + No?. 
SS {Nay (Z — Zry)?! 
ee 3 ay \? Zoey)” a 
Hence wll = FoF teesaetearenenen seshoneenae (7) 


This is a generalisation of the property of ,», given by (3). 


rl ‘ . . . 
Further, the “best fitting” plane to the means 2,, is given by 


z—-Z L—2X ,yY-yYy 
me pe. SO, a: ON OE (8), 
oz ox Cy 
Vee — Nyt 
where Y= I ches a Nee eh (9) 
1— "xy 
, Vyz Vez" ey / 
Ys iene pee EEE (10). 
Yay 


Let ,,R, denote as usual the maximum correlation of z with any linear function 
of # and y, then 


pe OM ES Pie «tp nic stag uns baal sustavewcannebines (11), 


= yz + 12x — ee Vazl ay ( 12) 


1 — ry, 
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Subtract (11) from (7) after replacing r,, and rz, by the appropriate sums, and 
we obtain 


Ley H?- xy RZ] No? 
YOY ~— Oz : , G; ; 
—_ | (2 — 2a) — 8 | Pd —2)(x@—- 2) v5 i jew ." —Z2)(Y-Y) 9s | 


YO = = 9 Oz,- a = tex = - a is 
<n E 3 = Za) — Ney — (Zzy — 2)(@—2) Ys — Nay — (Zay a z)(y— Y) Ys | ; 
Tx oy 


Using (8) this can be written 
[xyH? — xy Rz"|] No? =SS {nzy (Z2y — 2) Zay — Z)} 
= SS {ngy (Zry — Z)*} + SS {nay (Z — 2) (Zxy — Z)} ...(13). 
But SS {ny (Z—Z) (Zz, — Z)} 








= Nox (3020 2x — 3° Oz — Y3'Vs Vay Fz + Ys Fz yz — Ys'Vs VeyFz — Ys? Oz) 
= No? {ys (rz — Ys — Ys Tay) + Ys (yz — Ys — Ys Try)} 
Using the values of y; and y,’ given by (9) and (10) we see immediately that 


la te Ys Vay = Vyz — "Ys — V3" ay = 0. 
Hence (13) becomes 
(nyH? — ayR2) No? = SS {gy (Zey— ZY} v.seeeecescceeeees (15). 


This is the generalisation of (4). We deduce that ,,H,=,,R, if and only if the 
regression is strictly linear, that otherwise ,,H,>,,R? and by (6) that ,,H?< 1. 


§ 3. These properties and definitions can be extended to the case of m 
variables a, %%, ... ®%m. We now use » »& for the mean of 2, when a, 2, ... mp 
are given and denote by S a summation extending to the variables 2,, 2, ... mn. 


1...m 
If we define the correlation ratio of a, on a, 23, ... 2m by the equation 
Ty 2 y a. \2 
Nef — as. a = 8 (G,— a5. til tren .ecssecetiseces (16). 
l...m 


We can deduce in the same way as in § 2, the relation 
2 2 (mm = \2 
Nos, wlth] B [Gy — a9. hl Gesal osc0seessvenwssesests (17). 
2...m 


In order to generalise (15) we recall that the “best fitting ” linear function of 
the variables a, 73, ... Zm to the mean o3 ma 1S 


Y 4 x XL. — 2 By — Ws, x. x 
be yg by 0. yg (18), 


a; a) 2 oO; Tm 
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Ry ‘ . es : . 
where by, = R. and Ry, is the minor with its proper sign of the element in the pth 
ll 


row and gth column of the determinant 


=| l Tro eee Tim | 


Tm mae 1 | 


while the maximum correlation between any linear function of a, a, ... %, and 
2,18 55 mit, where 


R- Rk, 


—o3 mit = — Paice Dasha oe Ols Man oes od icasncceesecvs (20), 
ll 
( 7 n 7 yf y ~ , > \} 
5 {M2 (&, — %,) (#2 — Xe)} a {Mis (4, — X,) (as — %5)} 
= Dis ——_—_— “a-~-eeeae - + bis - Pa 
No,o, Noo; 
(op 7 . > \) 
S {Mam (a — X,) (tm — 4 m)§ 
In : 
+... + Oim — . $= dd iccecee (21), 
N 10m 
Subtract (21) from (17), noting that n»= S {n» m}, etc., we obtain* 
3...m 


(2,3, . we —o3 


999 


' 2 G7 
= 2 {Ns m(2,— 2 m&)°} +8 Sr — Dy (@, — %) (a — 7) 
2...m 12 t ox 


m R,*) No; 


g ) 
pious’ ~ {im -- Dim (2, — X) (Lm — Zm)} 


lm om ) 


. 7 Ee) 
= 8 {Me m(X— 2 m2) } +8 Sn — Dye (a2 — 7) (2 m&, — XZ) — ... 
2...m 2 { om ) 


Y ( O71 / - ; . 
+8; Nin is Din (Lin — a m) (s mi) — z)} 


m { m 


2...9 


’ - \o Oo} 
= 8 E m (XL, — 2m)" + Nem e bie (aq eg Zs) (om — 2) = eee 
n 3 : 


8 i 
“- E m (2... m%1 — %;) . m2, — X + dy, — (#2, —F%)—... 
o 


2...m 


» 


using (18) this equation becomes 


/ ea . / —  =—\% r 
(2,3, wil; 2 3, wh!) No! = S {Ne m (2 ma, — %) (s mt, — X,)} 


’ “ 


2...m 
* ~4 2) = — yr 
=S8S [Ne m (s..m& — X1)*} = S {Ns m(X,—2%) (2 mr — X,)} 
2...m 2...m 


* By an extension of the notation described at the beginning of this section S3,,, denotes a 
summation with regard to the variables x3,-a4, ... 2,3 41,9,..m is the frequency of a particular com- 
bination of the characters x,, x2, ... 2, While myo is the frequency of the combination 2, x2. 
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But Ss {No mM (X, kan Z,) G mr rasa X,)} 









/ 
= S dn m 
2...m ( 


7 a) tam a (2. Be es Oe oy — ..) 24, 
C. oC. 


3 


and = Noyosrs= S {nm (a, —%) (@2—- 2%) 
1 


oe 


= § {Ms m (#2 — %2) (2. m® — XZ) 
2...m 


with similar values for 7,3, 723, ete. 
*, the right hand side of (24) 
= — bya Pry — b,2 0? — b,3b,20371'05 - 
— oP) 3713 — Disdieo P2123 — O20? — b,3b,0;7 ... 


-_ ob,» (—Tp- Dro 7 DisT2s a Dis?'04 waa an Dim Tom) 
+ ob; (— 113 — bs — Dig 144 — +++) 
PURSE WOE PaO REN No ee (25). 


Each line in (25) is identically zero from the definition of the b’s and the properties 
of the determinant in (19). 

S {Ne m (2 m&, — X,)?} 

Hence sal’ —-. whi= Voto tte (26), 
oy 

so that the fundamental properties proved by Professor Pearson in connection 
with the correlation ratio , hold for the generalised H defined in this section. 
In particular equation (26) shows that a necessary and sufficient condition for 
linear regression in multiple correlation of m variables is that 


wll ,* = 39 m R;*. 


For in this case the mean value of any array of 2, will lie on the “best fitting” 
m dimensional plane. 


2,3,.. 


$4. The regression surface of z on wy being assumed of any particular type 
the constants in the equation may be determined (i) by the method of least 
squares, i.e. by making the sum of the squares of the deviations Z,,— (xy) a 
minimum, z= (a, y) being the regression surface, or (ii) by giving such values 
to the constants that the correlation between z and ¢(a, y) shall be a maximum. 
When ¢(z, y) is of the second degree the two methods lead to identical equations 
for the determination of the coefficients. 

The same equations are also obtained if the surface be “fitted” to the means 
by the method of moments. There is, however, a distinction to be observed. The 
equation z= (a, y) when the regression surface is of specified degree contains a 
definite number of constants and the first two methods will give exactly as many 
independent equations as there are constants to be determined. The method of 
moments will give as many equations as we please if sufficiently high moments 
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are used, including of course the equations given by the “least squares” method 
or maximum correlation method. Even without introducing high moments, when 
there are three variable characters new equations may be obtained by the method 
of moments, by combinations of characters which do not arise in the other methods. 
The method of moments is most convenient for our purpose, but we shall only 
employ those equations which can also be justified by (say) the method of least 
squares, 

For convenience let the origin be taken at the mean of the three characters so 
that Z=7=Z=0, let g,+,1.. denote 

SSS [Nayerty'2"} Pst, 
No,fo,'oz" o,°o,'o," 

With this notation r,, and gz, are identical; when z does not appear in the 
product, it is sufficient to write 9,6, 

The most reasonable next approximation to make when a linear function 
$(«,y) does not adequately represent the statistics is ¢(#, y)=a quadratic 
function of a, y. 


Let Lor 2o. Pe. aN a (28). 


oz Ty Gy GxFy Gy Gy 


Multiply (28) by n,,z, sum for all values of a, y, z and divide by V 


PT lin HOR | scar ces ces concer essskincaieeed (29). 
Multiply (28) in turn by ~ times =, cg ara = ‘, and sum as before, 
and we obtain 

Pay SO Dray + Odgty + OG + fdagt  -00-scccescevccveessvesenseres (30), 
yg = DF Af ay + Coley + Ofaty +S dys -.2-ccercrceccccerevececcesees (31), 
Qayz = drzy + A axty + bqxy: + COa2y2 + Cay + foxy Maes pocunenaed (32), 
Yatz = A+ Aqas + bqary + Cary + CGxt +S Qnty? cveceecceceeceeeeeees (33), 
Yyz = 1+ Agaye + dgys + Cony: + CQary2 FS Gy sscececseccereeceaces (34). 


Actual numerical fitting shows that in many cases e and f are small compared 
with c*. This is the case when the regression of z on « for a constant y and of 
z on y for a constant « is linear. We shall therefore confine ourselves in the 
present preliminary paper to the case where we may write 


Here for constant # or y the regression of z on y or of z on @ is linear. 


* Cf. Census of Scotland, 1911, Vol. ur. p, xuvit. where Mr G. Rae obtains by moments the 
regression of fertility on age of husband and wife. Let W=age of wife, H=age of husband, C=number 
of children in completed marriages. He finds 

C wy = 20°149498 — 0°555812W — 0°173804H — 0002846 W2 — 0:003494H? + 0°012675 WH. 
See also the paper by E. M. Elderton in the current part of this Journal, pp. 291—295. 
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Equations (29) to (32) become, when the regression surface is given by (35), 


PO Ss ica iidinisce cee ataen (36), 
Fey POF ECs ss cress enccintomberrsWensstecent (37), 
Figg OD Oy Flay. 5 ins tn enacindensgedsncenvvconieested (38), 
Yxyz = EN gy + AQapy + DGya + CQatry2 — seeeseseeeceeeeeceees (39), 


== CF gy + AQx2y + bgzy2 + CYz2y2 by (36). 
Solving these equations we obtain 





a - b 
| Very Qxty Taz ie Vary 1 Yaz 
1 Yay? yz Vay? Vey Vyz 
Yay? § Yaty2— Mary — Yaryz Qaryt — Pay Yxty Ynyz 
ig : = : ~ ..-004(40), 
l Vary Vaz ] Vey Qxry | 
| Vny 1 pe | Tay 1 Jay? 
) Qary Yay? Yaz | Qty Yay? Yay? — Try 


we have already denoted the partial regression coefficients by y, and y,' so that 


Vez — VyzT ay Vyz — Vaz ey 


= é : SS ee ¢ c . 

x= — a and ¥; a. (9) and (10) 

In addition let ee 5 ee hee (41), 
1-1 ny 

and gL eae Meee eR (42). 
1 aoe Tey 

After some reductions the determinants in (40) yield 
DED) Secccipndveceveqnen iitenaete ee (43), 
DOP OE cnc ccciinsciesdcpisishaed te ree (44), 


Orx2 + PT yz + Jaye 


C= meee UG aclu pone FaudettmCn eee 45), 

OQary + PYxy: + Yury — Tay ‘ 
also d=— Wiad) sees nn vcaes voile aaistinene enone hsuicgn toe submeeeee (36). 
Note that YsQary + Ys Gey = — (Oran + Pray) .-cececscsceescssecevees (46), 


and YsVex + Ys Tey = xy? (ef. eqn. 11). 


’ i = ) 
= S | Mayz (2 — Zry)*} 


& 7 eniti 9) on 
§5. Bydefinition (1-,,H/) Nop 


? 


.*. using (35) 


P 4 (Nays (2 ax by cay?) 
2 2) = : —d—— —#%-——+})} 
(1—,yH2)=S8 OT a sae ae a) 
=1- 2s | < (a i= by eo ) Maye 
oz 


VS 
+24 048 + Cqgey t+ Wdergy + 2abryy + 2acqgry + Vheqry:, 
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Ox Cy zy 
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= gy? = — are, — Wry, — Wry + C+ 0+ E+ Cqay 


= 0 (— Peg + O+ dryy + COx%y) 

+b (—tyz+b + argy + Gn) 

+ €(— Garyz + Array + AGary + DG xy: + CO xy) 

+d (d+ crzy) 

Fi a i lise 5 x cnk cbingarcenvad aes Uesinsesnensanyernan tat (48). 
The first four terms vanish by equs. (36)...(39), 

OS lg Es Og Pas | +0020 0 0 civdkarcarevines (49). 
If we now insert the values of a, b, c from (43)...(45) in (49) 
ayll 2 = (Ys + C8) ex + (Ys + Ch) Pay + CO xyz 
= Ys ex t+ Ys Tye + (Ore + Hr yz + Yay) 


p's (Ore 3 br yz rk Yxyz) a 


= gy he —, 
wc his uty + Jaye + Qary2 — Tay 





a Qay? + Qrary — 2Qa2y Vary21 ar = 
| gen ra}! x 7 xy Qary Vay mT ‘2 T° zy) 


1—71,, 


on mH? A RE ct {Gaye (l— Tay) — Jay (Tze? xy — ys) — Qaty (yz xy = 1xz)}? (350). 


It follows from (50) and (15) that 


ce Chek |) Le (51). 


2 
Yx2x? — Vey > a 
1— 7174, 


If we eliminate q,,, (which is a triple moment troublesome to calculate) 
between equations (45) and (49) we have 
ay? = Arex + Or yz + Cxyz 
= AN x + bry, + C (Gary — Tey) + ACGary + beqay: 
= Tex (Ys + cO) + Vy (ys 3 ch) + (Gary — 1 xy) 
+ (730 + €°8) Qary + (ys ¢ + CD) Gary 
= 2 [Quy — Try + Oqary + Pay: | + ¢ [Prez + PI zy + Y3Qaty + Ys Gy] 
+ Ys" 2x + Ys Vey 
= gy RZ + C [gary — Pay + Oqary + OGx,2] by (46) and (11), 
"2 "yy? — 2 y2 yl ay 
os re. aa? =¢ | gov — Pay — Yay + q a <i fora : 
xy 
ay if 2 Se. ayR,? 
_ Paty + Pray — 2Qey2 Gary Tay 


1 — ry, 


Hence c= 








Gary? — 1 ny 
This value of c? is positive by (51). 


Equation (52) shows that ,,H?=,,R? is a necessary condition for linear 
regression, which we have already proved in equation (26). 
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The regression surface of z on x, y is, with the values we have now obtained 
for the constants 
ZZ _Vez-VyVry © 2% 4 "ye TaeT ay Y — y 





Gs l—1y Oe l—-ry oy 
es J He — wy R? [Pry ary? — Gay V—- 2X 4 Very Yatty — Jaye y-¥ 
ie 2 a + Oren, — 20, . — 7, = 
2 Gay + Qnty — 2Qxry Qay2 xy U Ts avy Cx 1l—ry Cy 
Qaty? — Vay — os "oe 
=e ay 
(v@— 2) (4 -¥ 
+ it Fgh cssutens (53). 
TxTy : 


The terms in the first line give the ordinary regression plane. In most cases 
the regression does not differ widely from linearity so that ,,H?— ,,R, is small. 


§ 6. We must now get some idea of the relative magnitudes of Yay2y Yatys Ya2y? 
and moments of lower order. 
First with regard to q.2, which is equal to Pa , 
‘ : Oxo, 


S(Nayay) _ S(nz¥z%*) 


Gary = P F 
day No; cy Nao,2c, 


If the regression of y on z be linear 


Oy . 
(1. —! x Vey 


ps te 
- _ = = Nyy B:, 
No,7o, 





a) 
Qa2y = S 


so that g,», is zero if the regression is linear and the frequency of 2 symmetrical. 
In fact qx2, — 1x, V8, =0 is the same as Pearson’s criterion for linear regression given 
by €=0 (Skew Correlation and Non-linear Regression, p. 30, Eqn. (1xix)). 
We may obtain a good approximation for gm, by considering the regression of 
y on « to be parabolic. This is a natural assumption to make if the regression 
surfaces of # on z, y and y on 2, x be also of the hyperboloid type we are discussing 
for the regression of z on @, ¥. 
For with origin at mean we may write 
Bye fy gz. hyz 
wig IL, -. 
ox Sy Gz. TyGz 


Hence keeping y constant and summing for the 2’s 


Z, Y Qe, . hyz, 
_y = 6 ob ty op 9%y + Yry . 
ox Cy oz GyTz 
: r v, Nz — T*yz y? —— y } 
But 4 = Vy, 2 ter MA ae a =e 2 LY * 
oz Cy Vv Be — By —1 (Oy Cy 


* See Pearson: Jc, Eqn. (Ixv) (where Y,, is a misprint for X,), and fy”, 8; refer to the distribution 
of z. 


51-2 
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~¥ is a quadratic expression in y if we remember that h being of order 
ox 


V yell? — yaks? is of the same order as V yn? — ry2*. 


But the relation $2 (x? — xy) —==0 is satisfied when the regression of y on 
« is parabolic F. 


Here od. = B, = B, —1, €=e- Vay VB;, e= Yxty t+ 
ee duty — Vay VB, = Vaty? — Tiny VBg— By aL voeeeseeesecsceees (55). 
Similarly Gay — Tay V By = V yg? — Tiny V By — By —Laeeeeseeeeeeeeeees (56). 


The use of these approximations will save the direct calculation of qa, and 
Gz provided we can determine the signs to be attached to Vz?—T*xy and 
V2? —x,. This is often easily done by inspection of the regression curve whose 
equation is 


Vx x“ / aNy? — Try |e x“ 2 
-=<=f + —=- -— — 1}%. 
: Cx Vv B, + B, -1 ‘= VB, xv is 


Cy 
We can approximate to q,2,2 as follows||: 


ae ieee SS (Nzy @*y*) i S {n,a* (o*y, + (V2 —y))} 

dxty? No,2o,7 No,o,7 z 

where Y, is the value given by the regression straiyht line and o*y, x nz the 
second moment of the array of y’s for a given x, about the point Y,. 


S {n,a* (Y,—7)} 


But : = 8 (nz, 21 y,0,7)/0,7 
Noo? ices Sa 
= Be 1 vy 
y 9 9 
. S (Nz 2? a" y,) ; 
Thus qa? = No Sn — + Bay, 
a Vy 
oe S (n,y?o7y 
Similarly Yaty = =e xv) + By P xy; 
“ x y 


or, so far without approximation 


S (Nya o* Ya) + S ( Nyy? Oxy 
Qaty? = 


+ $ (RB, = B,’) 1 xy» 


2No,20,7 


* It is noteworthy that the hypothesis that regression of z on «, y although of 2nd degree is such 
that regression of z on x for a constant y is linear leads to the result that the total regression of z on « 
is parabolic. 

+ Pearson, l.c. p. 28, Eqn. (lxiii). 

t Pearson, l.c. Eqns. (li), (xlv) and (Ixiii). 

§ Pearson, l.c. Eqn. (Ixv). 

It can be found fairly directly by tabling to the squares of the variates, when we need a simple 
product moment. In a later part of this paper some comparisons of actual and approximate values for 
numerical cases will be found. 
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Now the mean values of o*y, and o*y, are known to be 
o/(1—7*,,) and o,7(1—?r,y), 
and the deviations from these are usually somewhat irregular. Rarely can we 
do anything better than assume them to vary with a slight linear variation from 
the mean. For example 
o*y, = oy (1 — 7% zy) (1 + Ax), 
where is small. In such a case 
S (nz @*0* yx) ; 
No,o,? =(1 —1ryy)(L+2V8,), 
or to a fair degree of approximation*, we may put 
: gary = 1 — 17 yy + $ (Ba + Br’) Pay, 
and thus write 


Quy — Tz = 1+ + (B.+ By’ — +) ey 
=1 + Wry + 4(B, —3 + BY = 3) Tey. 


The latter part of this expression vanishes if the frequency of the w and y variates 
be mesokurtic. It can of course be retained if desired but its product with 


V wy? — xR? will usually be of the second order. If we write 
$= 4(.—3+ By - 8), 
we find the approximate regression surface 


Z —Z Vz —VyzPry (@— 2%) | Vyz — Va2V ry (Y — Y) 


Cz l—fry Cx 1 — ry Cy 


+f saat (e-DO—D _,,) 
V 14+(0+ W) Pry Cx Cy o 
This equation (57) enables us to express approximately the multiple ,,H, in 
terms of the simple 7.2, 292, xy, yNx- 
§7. To obtain this connection between the multiple ,,H, and the simple ’s 
we may proceed as follows : 
fe pg Bo 


, origin at mean. 
o, Te Ce Cutty 


Hence keeping y constant and summing for the 2's 








en SY | TR TET (58), 
Cz Ox Cy Ox Cy 
But yn? a SSS ISteass (Zy 2 zy} = SSS (Nayz 2y') if z= 0, 
No? No? 
oui H= SSS {Ngyz (Zay — 2)*} a SSS {Nzyze7*xy} 
ay 4tz No? Nao? , 


* Ie. we are neglecting terms of the second order as AV Bi. 
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en eee Nayz (a (a a Ty) cy (x oe Zy) & + hy by 
ayH; yNz - 833 |" \ + )(2a+a = +2 


vyz x Gy y x oy 


4 Yere )| 
Cy Cx 
= 88 Sd (=) (a + 2) 2 + oY) + (a + 2) (2) |} 
& y y y v 
= ” {(« + 2} (a+ 2H) 8° = eal +8 + SS do “(a 4 =F 3 


=0+8\(a+ ey)’ (st9 = te. 3| 





Cy 7 a fh" 
Now S {ngya*} = S {(v@ —&y + Fy) nay} = ny (o,,7 + 0 + 4,7), 
x x 
‘ P cy \° "Oxy 2 n,) 
yH? —,nZ2=S (a =v 
ay 4tz yz : + oy! o,? V 


a le cy am Ny | 
~8{(0+ 3) ny 


JM 2 2acy al 
= (L— pnd) 8 (ae + U4 SY) 


y o,” 
gy lh — gta? = (1 — get) (SP 4+.) occ. cece see seceseed (59). 
Similarly ap thd — gt? = (1 = 92) (OE + A)... .cescceceseeses (60). 


Remembering that a=+;+0¢0, b= ig +c we get from (59) and (60) 


u H? -, m3 ote 2 
+ c? {ys — y38 — Od (yah — ys 9)}....-- (61). 
From the values of ys, y;', 8, @ in (9), (10), (41), (42) we obtain easily 


Ty2Qaty — Vx2Qay? 
19 'y2z42*y az Yay 
sh = d= Py 
1 — 14, 


ys 6 —7;0 = eis — Gay" 
sb — 739— Op (ys@ — ys 9) 


_ Va24ey — VyzVay? _ (r ("xy Yay? — Yat y) (7 ‘zy Vary — = Yay?) (Ty2Yary = xz Yn) 
l—r*,, (1 — ray) 


and c* is given by (52). Hence (61) may be written 


? 


(all? — a B) co — Taz" ay) (xy Jay — Yzty) mi ("xz — VyzT xy) (Tay Jay — uty) 
wy ++ 2 ay “vz | (1 Pai tay) (1 ae yNax) (1 aa ay) (1 oe 2y’) 


(Tx2Qaty fe VyzQay ) _ (7; Vary Yay? — ie Qa2 2y) (7, “ay Yae ‘y — Yay 2) (7 yz] ae a ms Vaz Yay? )) 
l1— Tey (l_-r 2) 
(Qraty + xy — 2Qay?Ga*y" xy) | 


1— ry 








(qaaye — Txy) — 
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_ (Tye = Vaz? xy) (Vey Yay — Try?) ye? _ (Vaz ~ TyeVay) (xy Fay? — Yaty) aN 


(1—ry) (1 — y%a°) (1 — r*yy)? (1 — y 22") 
4 Ca—  TyzT ay) (r Ss ~ Vx? "zy) (TyzQaty Yo Ve2Q ay? :) 
(l- _ Tey) 
— 2h? {(T yz — Vaz ay) (Vay Gary — Gay) _ (az —V yz ay) (Pry Yoye — Yo? ” smell 62). 
fT ( (1 — ray)? (1 — y 92”) (1 — 1 gy) (1 — ny’) = 


In general the square of any correlation ratio (ordinary or fa differs 
little from the square of the corresponding correlation coefficient; also we have 
seen in (55) that an approximate value for q,, is 


Very VB, = Von “= Ty VB, ~~ By —1. 

Now A, is itself in general small, so that without making any assumption as to 
the relative order of magnitude of 8, and ,n,?— 7*,, we may safely treat q., and 
Jay: a8 small quantities. 

Thus it appears that (62) is an equation in which all the terms involved are 
small, and a certain amount of care is required in deducing from it an approxi- 
mation to the value of ,,H7— .,R/?. 

We have 





1 1 ya — Tray 1 
= td pa —— + 
L—ym” L—ry (L-yn2)(l—-Ty) 1-12 ‘ | (63) 
l 1 aNy on Pe 1 ee gees , 
— = +__* oe th 
a Ny 1 — xy | 1— ay’) (1 —1r*zy) 1—- Tay 
yMz na 1 wg 1 } 
i= y Nx Te (; = T xy 24 &) + (yet 2 ‘a) es — Pay i gi) 
PSs EE Oe eee ae chon et (64), 
1 — 7, 


where A, = Tay (ya — May) pie rey and ».= (yne! = Tey) (ye! — Tay) 


a 1—,m2)(1-r* .) — T xy (1 — ya? ) (1 — ry) 


while iz = = Re Tk, BR ee ere gee «ee (64) 
—aiily — ¢ ay 


where 2,’, X.. may be obtained from \,, A. by an interchange of # and y in the 
suffixes. 





: 1 1 om 
Finally - : 5 Pepe we (65), 
2 Pay? + Fraty — 2Qay2FaryTry § Yxty2— Tay 
eR ae ee i 
ry 
where 
Ko = Qa + Paty — 2a ary xy 





Pay + Paty — 2ayQary? 2) 
a2 
1—7 2, 


The suffixes of &,, &,’, ,, Xe, Ay’, As’ and «, denote the order of smallness of 
these terms. 


(1 — r*2y) (qa2y2 — T xy) (qury — Pry — 
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We make the corresponding substitutions in equation (62), including the value 
Tye + Ten — 2 ee yeas 
1— 1, 
accurate formula for ,,H?—,R/: 


@ ‘ (tus —Vxz Ty) (Tey a2 Bs Jay) : ) 
bz 2 ! yet an 
(ay H? — aR) | (1 —1*y,)? (; = ay fi 


for zy? on the right-hand side and obtain the following 


a a Ty 2 


re {Set = Ty2Goy? (Tye Vary — M22 Yay) (Pay Yay — Gary) (Vay ey — Uv?) | 


“ez — YyzV x " ay? ~ Yar, 1 , 
=, "yzTay) (Vay Jay? — 4: ?( — BZ £’) 


1 — rp (L — 1gy)° ) 


1 )| 
x (= 3 = ae 
Qaty? — Tay 


= (Tye aa Tx2V xy) (Tay Qaty — Yay) ( ra -+rA+ >a) 
y 





a cs wy) - rr, 
(Taz — TyzT xy) (Tey Yay? — Vary) ( ee , ; 
aes a ee > r Ae 
(1 a 1 xy) 1- 1 ny tA HAs 
4 Crary — Meee) (ee — Mya ay) (Tye — Vea) 
(1 — rz) 
_ ye + Mae — 2822 y2V ay |e = a2¥ xy) Yay Jory — Ye) ( * g ) 
(1 — r*zy) (1 — ry)? i=" 
(xz Pity Tyz xy) (Try Vey? - uty) ( 1 ; 
(1 — rzy)? 1— ry +6 Te (66). 


The right-hand side of (66) apparently contains terms of the first order, while 
on the left the lowest order occurring is the second. But the coefficient of g2, in 
the first order terms is 


1 


. ° . . 2 . ° ° a2 ° . e ° . _ » ) 
d-r,,? {(Tye — V2 ny) Vay? zy + ("x2 — NyzT xy) 120 + yz (Vaz — TyzV ay) (Tyz— Vx21 ay )} 
vy 


° . 9 ° . poe Ll (72 2 Dr » ° 
* a-r y {yz — Vx2z% xy) Tay + (1x2 — TyzT xy)} (7 yet? we — 20 ye ny xy)s 
ae 


and vanishes identically. Similarly the coefficient of g,, is zero and thus 
equation (66) is a relation between terms of second and higher orders. 
A first approximation then may be obtained by equating second order terms. 
This gives 
(ay H? — xR?) |e — Vaz ay) (Vay Yary = Yay?) — (xe = N yz? ny) xy Yay? — Yaty) 
rt ! = ey)? 
— "a2 daty ~ Vz Joy? | 
(1 — Ty) (Gx2y2 _ Ty) 
— ("ye = Vaz" ay) (Vay Gry — Tey) E _ Tye + Me = Beat yet ay E 
=? 2 2 1 
(1 — 17)? 
ion (Taz — Tyz"ay) ("xy Jxyt — Yxty) [a ft yz + Pe — 2x27 yzV ry Fs ‘| 
1 i a a Se 1 
L 


LT 


1l— Tey 





(i- 72)? 














The coefficient of ,,H?— 
1 


rv’ - 


2 
1 ry 


(67) we replace 1 —,,7,2 and 1 


of the 


direct calculation 


denoted above by 3, 3 


3 and 4. To these latter we 

A good approximation for 
we may use 

Ya yy . 

Yat y? — rm ay 

We saw in § 

Vary = 

Vay? = 

approximately. 


Vey Qaty — Yay? 


— Yay? "yz 


Qax2y Vez 


and 


Vary Vay? — Yary 
Yay? yz -4q aty Vaz 


Biometrika x 








i - — 7, e [Qaey xz — Yxy27 yz] 


. 4 1 yz + P22 — Zien ysl ay Fa 
1 : — i= 
1— May 
Similarly 


Yyz + "a2 —~ 2i'xz Vyzl ay 


We shall still be correct to second order terms if when using (68) 


generalised correlation 


The coefficients involved are the ordinary coefficients of linear 


Yat y? and 


ny V(Bs =))G omy — 7 =u) — V( (A. — =) (ae - 2 
Naz V(B2 — 1) ( 


_ Tay N (B. — 1) (y M2" - 
Vyz V (BR. a VD) (yn _ ay) 
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ay it? on the left reduces to 


1— 717, 
1-— i, 
Qaty? — Vay 


yNz — Py ee £ (TyzT xy a xz)" 
= 1 5 aes 
1— xy 1— Mxy 
ee N27 — Ty (ye — ray) (new ee ay (68) 
=. — ——= ——]} oeeeee 38). 
1— xy i— ya i May / 


2 2 
_2z — Vx 


22 
at ay 


“a2” xy 


=) (69). 


1-r,y 


. : 
—. Ny — VT ay (° 
; 
1— xQy 


— Ny? occurring in 





denominators by 1 —7%,,; 
that 
Qx24;2 1 

(xy H. bay y R,7) defy gras” 

: Qaty2 — 

Vey Vary — Yay? yz ~~ VazV ay " e - (Y, 2 ey — "22 ? e 
: dan 4 (y92 — zy) aed ( — (y Nx — P zy) 
QatyVaz — Yay2V yz 1 — rny , Fey xy : 
Vay Yay? — Yary : Tne — ry2Tay - NZ — 7" 2g) = (a= “se) (x Ny? i ie .) 
Qay?V yz — Ya2yV xz 1- Yay oe 
Bae (70). 
§ 8. This result is of importance, as it shows that the heavy labour of the 


calculation of four simple correlation ratios. 


may approximate by the methods of § 


27*,y 


§ 6. 
Jatye — 1 
L +7 gy 


May 


$ (8, + BR, - 6) + 2) rx 
=735 ‘wy + 4 (B2+ B: — 6) rxy 


S 6 (equation 55) that 
Yay N /B, + V My? _ 
Vay V By + V yn _ 
8, and £B,’, which are 


be very small compared with ny’ — 


Mey VB. — By —1 ’ 
xy V Bo — By — 1, 


Py and 412? — Txy 80 that 





) (aM? — a y) Vyz V(By = 1) (yn — ” zy) 


ny) = V( 8-1) (xny" =. 1 yy) 
= V (Bo —1I) (a My" 05 


Ixy ) 


or 
to 


) and (69) in 


ratio canbe replaced by the 


regression 
and expressions involving product moments of orders 


is » . Ifgreater accuracy be needed 


zero in normal correlation, will in general 
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In the important case B,=8,/=0 and ,n.=2ny=Tx these approximations 
fail, and so does the process by which (70) was obtained as @ and ¢ vanish and 
(61) is indeterminate. 


We must then fall back on equation (59) 
ay? — yn? = (a + &) (1 — yz’) 
=(¥;? + c?) (1 —7*,,) since @=0 and yn, = Try 
Coy H? — zy R?) Bes Ty) 


= ys? (1 — 1) + = - re) + Ky), 
Qaty? — Mxy 
To 2 1 —7 0 2 ° a 
or (ay MH? — ay RY) (1 we ts | =yNz — ay R? + Ys" (l- Troy) 
Qa? el 7 ay 


neglecting 3rd order terms. 
The right-hand side reduces to yn — r*,,, so that 
> - Jaty2 — Tray . i 
wHe—_Ri= (fen 2) Re ad ced dic eae (71) 
xy? — 1 
_ Bray 
1+ ry 


of course in these circumstances (60) would lead to the value 


(,n2 — 7°,,) approximately ; 


Soe fed lew ty rE Rae APM oan (72), 
1+ 1 ey (om: Ma) si 


showing that if 8, = 8,’ =0 and if (,.”,?— 7x) — (yn? — T2y) is of higher order than 
the first then (,,97— 1°.2) — (ynZ — 7°,y) is also of higher order than the first. 
We shall now seek relations between the six correlation ratios of three 
“hyperbolic” variates. 
From (59) and (60) we get, on eliminating ,,H/, 
yNe — oN? =P — B+ (yn? — 29,7! + a yna? — Pen? 
= 987 — Ys 1 Ys yD? — Ys22My? + 2 (Ys yne? — Ys bx y’) 
+ (8 — B+ yna? — amy? + Pyne — $*x0,') 
= (ys? — ys") (1 — 1*xy) + ¥3* (ya? — Txy) — Ys” (ey? — xy) 


— 2c (yO - y'$) (1- T xy) 
+ {2ey;0 (y Mx" er: Ty) 4 2cy; b (xy = 1 2y) 


+O (P — G+ yn? — oy + Pyne? — b*2y’)}- 
The terms in the second line are second order terms. Neglecting these and 
noting that 
(ys? = "Y3") (i aia 1 py) = ae — T22 and (y30 = "3 >) @ = Ty) = VyzYay? — TazQary>s 
we obtain the following equation for c: 
(y Ns — ry) - (xn? sia 145) = x3" (y Na? — 1) + 3" (¢ Ny a Ty) 


= 2c (Tx2Qaty = TyzQay2) cocces 
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pty? “ 
Now (2H? — mR; ») fen = 2c*r*,, if we neglect second order terms. If 
Qaury? — Tay 


we use the value of ¢ given by {73) in (70), and adopt the notation ,U, for 
2Nz — 127, we have 
1 xy [yU2 — 2U 2 — yey se + Y¥2*2U yP 
=2 (TxzQaty - TyzQay2) {(?xy Quy ai Yay?) Ys. (y U, ae "Ys" y U;) 
— (Tey Jay? — ary) Ys (a Uz — 2" Uy)}...-+ (74). 
Let us write ,y- for ,U,—,",U, and yy, for ,U,—y2,Uz, then (74) becomes 


(2X2 — yXz)° May = 2 (azQa2y — Nyz ary?) {Ys (Tey Yary — Yay) yXz — Ys (Tay Jaye — Yary) Xz} 


(75) is a relation between second order terms and it is sufficient to use 
equation (55) for q.2, with 8, replaced by zero and B, by 3, so that 
Qaty = V2, U, ; 
day? = V2, U., 
(xXz = yXz) 1 ny = 4 (Tee V,U, — Tyz V,Uz) {ys yXz (Try v,U, U, Dad vy Uz) 
= YsxXz (Tay V,Uz— V,Uy)} seeeee (76). 
This identity between 27, yz, 2%, and ,7, is symmetrical in 2, y. Two more 
such identities may be obtained by interchanging the letters a, y, 2 in cyclic order*, 
There are therefore three identities between the six correlation ratios : 
yNxs «My yNz. zy az. Za 
I have not so far succeeded in reducing them to simpler forms, although possibly 
such exist. In special cases simplifications reszlt. These are illustrated in the 
following section. 


§9. We defined y;, y;' the regression coefficients of z on a, y by the equations 


Vaz — VyzV ay Vyz — VaeTay 
Ys = $$ 9 ne ecccvecceceevere (9), De (10). 


1— ey 1 — ay 


Let us now introduce the remaining regression coefficients 


y _ Tye — Vee yz 9! = 2 ryan ye 
_ i= : 
1—ry, . 1-7” ‘yz (77) 
evercccecccesccces ( 
= Vyz — Vaz" ay + Vay — VyzV zx 
2 2 ? gS Be ee 
1 — 7x2 1-9, 


It will simplify the algebra if we use d’, py’, v*, NX", w?, v’® for yUz, Uy, 2U2, Ue, 
2Uy, yUz respectively and P, Q, RB, P’, Y, RB’ for yyw, sxXy: aXe a a, 
respectively so that 

P=iz-— yy et Q = pe = yo? X?, R=,pr-— se) 
P=2r2-— ye, Q om p? aes eV", R =p?- ytd? J 


* We are supposing here that the regression surfaces of x on y, z and of y on z, x are also hyper- 
boloids of type similar to (35). 


52—2 
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The three identities connecting the six simple 7’s become in this notation 
(R- RV rx = A172 w’ — Ty2) {ys R’ (Tay —r)—- yak (Txyr — pw’) . (79), 
(P — P’YP ry, = 4 (Tye! — Teo) [yy P’ (rye — w) — HP (Tyee — v')} --(80), 
(Q— QP rz =4(TaW — Pry v) {ye Y (Tan —v) — ¥2Q (ez v — VY} «--(81). 
(i) We can deduce from (79) that if the correlations of both x and y on z be 
linear and equal, then the correlation ratio of # on y and y on @, i.e. gny and ynz 
are equal. Thus in biparental correlation, if the regression of the child on each 


parent be linear, then the correlation ratio of the father on the mother is equal to 
that of the mother on the father. 


For under the conditions stated 
az = yNz = Vex = Vey 


‘ 
Hence y; =; = —-~— and (79) becomes 


1+ Tay 
yet [2 — eB rey = bys (we! — A) {= 39? (Tey! — A) + ys oH? (Tay — HY}, 
or ya (X— pA + WP Pay = Arye (A MY {rey rp’ — 2 Dp! — wh, 
or (X— pw’)? | nt (V+ p= 4 (ray rp! — Ap! — VV — H)| =0, 


which reduces to 
(A— py? {H Vny + 2P + (2 — Pry) wi P+ yl (2ray + 3) (Tay + aa -— 
(Tey +1) (Tay + 2) 

But rz, is numerically <1. .*. the factor in curved brackets is positive. Hence 
AN=yp’, Le. y Uz=x U,,. “. ya = aNy- 

(ii) An interesting deduction from the identities (79)—(81) is the following: 
“Tf any four of the six regression lines that occur in the mutual variation of three 
variables are linear, so are the other two.” 

We have to prove that if any four of the six quantities X, w, v, d’, w’, v’ vanish, 
then the remaining two vanish as well. 


First let X= p=v=2 =0. 


(79) gives Vay? (ys we? + v? YP = Aree e’ (ys vPey e! — Ys"Ys B"}, 
(80) M8 tye = — Arey vy”, 
(81) Bb 7re7 = 0. 
From (80) atv [den Tye + 127 ye!] = 0. 
But dey, Ty + Ys 2 yg = (ue Mel ay = ata i « 
\ l — ry 
y =0, 
and pu’ =0, 


and these satisfy (79). 


There are three cases of this type. 
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The case \’= w’ = »’ =X =0 is proved in the same way. There are three cases 


of this type as well. 


Next take the three cases of type 


y=p=0, 
V=p'= 0. 
Equations (79)—(81) become 
(79) R=R,, ie. v2 =v, 
) Ny ry? = Arzyv’ (yyy? v? (-— v’)}, 
me) yh? = 4(— Ppyv) [— yo y2v? (— v)}, 
(80) leads to (y1'1'y2 + A'ey'y:) i290" = 0. 


We have already shown that the first factor is positive, 
vy =0, 
and hence v=0, 
and these values satisfy (81). 
The three cases of type X= = p' =v’ = 0 lead to 
(79) v7, = 0, 
(80) r'ty2,, = 0, 









(81) (ry? = Yo" r?) Nye = 4 (yz = VryV) {— ye YoY? (ng = v) + a2 2A? (1x20 — vy}, 


whence \’ = y= 0. 
There remain the three cases of type 
A=yp=0, 
p =v =0. 
Here (79) is satisfied identically. 
(80) becomes (A? = y Py?) 1 yz = — 4yzp [yy (A? — 2") (— w)}, 
(81) (uw? — yo? A?) 1p = yz! {— Yo (wu? — yo A) (— DI, 
which reduce to 
(AN? — 2p?) [12 yeh? — (921 ye + 4 Tez) B= 9, 
(pe? — ya?) (9% 2? — (Yo? 12x + 4727 yz) X2} = 0. 
The only common solution of these equations is 
=p=0, 
We have thus accounted for all the fifteen possible cases. 


(iii) Three regression curves linear. 


In six cases out of the possible 20 cases the linearity of three only of the 


regression curves involves the linearity of the remaining three. 
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Let X = p’ = 0 and either v or v' = 0. 
It follows from (79) that R = R’, i.e. v= v’. 
.. both v and v’ are zero. We have now four linear regression curves, .’. all 
six are linear. 
Let p=v' =0 and p’'=0. 
Since w=2r'=0, *. P=FP’ or X=N, so that 
P=, Q=-y7,", Et = »*. 
P=’, YV=p?-y, RB =—-yFn. 
(79) becomes (v* + y:°A?)? 12 xy = Ar yzysd? (Tzyv? — Ys'¥s 0°), 
(80) becomes 
(ya? v? = Ya? M7)? 1 yz = 4 (Tyzd — TryV) Yao ¥* (YoY — Yo) + Vaz (Y2'V — Yar)]- 
The first reduces to 


jes (Taz i Yy2V zy) Tay — 2rye 


. + Try} =90 
1 nae Poy ary ’ 


and the second to 
(av? — yo A) Pag + beyorye’ (Tyzd — Tryv)? v? = 0. 
Hence either X=v=0 or there must be @ very special relation between 
Cans Cone Tar 
If instead of u’=0 we take v= 0 we get similar results, i.e. in general the 
vanishing p, v’, w’ or pw, v’, v involves that of X, v, XV’ or A, pw’, NV. 
This accounts for six more cases. 
There are eight left. Of these six are typified by 
pe=v=0=y or p’=r"=0=—4h, 
and lead to the same conclusions. 
The remaining two are 
A=p=v=0 or N=p'=r'=0. 
The first supposition, A= ~=v=0 gives 
P=-y'v", Q=-y,r°, R= y37u", 
P’ =", Q = p®, RP’ =y", 
leading to 
(79) (v"? ys 2)? Pye = Age ys {Pryv'® — y's Hh, 
(80) (A? + yy 2v?) ry. = 47 zy vy {Pyzd? — my Vv’? | 
(81) (wi? ys? A?) 1 py = Arye d 40! [1z2p? — Yared}, 


which give \’=y’ =v’=0 or a very special condition to be satisfied by the 
correlation coefficients ryy, Tyz, zx. 
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We may conclude then that in general the linearity of any three of the sia 


regression lines involves that of the remaining three. 


(iv) If the regression surface of z on a, y reduces to a plane, the regression 
curves of # on y and y on # reduce to straight lines. 


We have as in §7 


zy x, l . ry K 
L=d+a—+ J 46 | TET RR aS .«+.(58). 
Cz ox Cy TxTy 
a, y ya — Tray { y” y ) 
But ety t,/ ¥ Y jv — Vf —1}, 
ox - Oy = B. ais B, —1 lo,? , Bs Cy ) 


¢ 


z, a 272, 2 pe es yNa? — Tn, 
Y—dia a/ % y+ * b+ arxy + (ANB, + Cc) J ge aral 


y? ( + Buy? — Txy)” / nt — Py ) 
+ *33(CTay + ¢ a +aa/ ; j 
o,? | yi B.—8,—1 \ P:— B:—1f 
+ terms of higher order. 
Now the regression of z on y for a constant wx is linear. Therefore the 
coefficient of y* is zero. To first order terms we may put 8,=0 and £,=3, 
and thus 


J yNa — Tay 
Cray + a / Oy 0 
aera , sind ay _ 
Sunilarly Clr + O > = 0; 
But ¢ vanishes when ,,,H,? = ,, RZ by (52). 
Hence if ee e— a » ft,*. 
it follows that ay = ya = Tay: 


We thus see that if the three generalised correlation ratios ,,H,, yzHz, »2Hy 
are equal to ,,R,, Az, 2R, respectively, the six correlation ratios zy,, yMz, 
22z, Mx» My» yz reduce to the corresponding correlation coefficients ryy, T22, Tyz 
and that the “linearity” of the three regression surfaces involves the linearity of 
the six regression lines. 
























MISCELLANEA. 


I. On Spurious Values of Intra-class Correlation Coefficients arising 
from Disorderly Differentiation within the Classes. 


By J. ARTHUR HARRIS, Pu.D. Carnegie Institution of Washington, U.S.A. 


WHEN the constants of the w and y characters of the population in 7, are quite indistirguish- 
able symmetrical tables* may be used, but not otherwise. 

Primarily and for the most part, however, the use of symmetrical tables has been restricted 
to cases in which the degree of interdependence between the measures of all possible pairst drawn 
from a considerable series of associated individuals—in short to. intra-class correlations |—is 
sought. 


The dangers of spurious correlation due to the artificial symmetry of the surface is then much 
greater§. Pearson|| long ago pointed out that when intra-class differentiation exists, for example, 
because of age in the case of characters determined upon the members of a fraternity, or of posi- 
tion on the axis in the case of serial organs, the values of 7 may be to some extent spurious. 

In the cases considered by Pearson differentiation is an orderly phenomenon, i.e. the magni- 
tudes under consideration increase or decrease with age, position on the axis, or some other 
extrinsic characteristic with such regularity that the relationship can be expressed by an 
equation which may be used in correcting the raw values of 1. 

In other cases, the problem is not so simple. Differentiation within the class may exist, but 
it may be difficult or impossible to arrange the individual measurements by any character outside 
of themselves to obtain the constants necessary for determining the true correlations from the 
spurious values deduced from the tables. 


Illustration I. The correlation between yields of wheat in variety testing. 


In variety testing, the experimenter seeks (or should seek), among other things, to determine 
the correlation between yields of varieties in different years. If this correlation be 0 (and regres- 
sion be linear) it is clear that the yield of a variety in one year furnishes no basis for prediction 


* R. Pearl, Biometrika, Vol. v. pp. 249—297, 1907; H. S. Jennings, Journ. Exp. Zool. Vol. x1. 
pp. 1—134, 1911 ; J. Arthur Harris, Biometrika, Vol. vu. pp. 325—328, 1910. 

+ K. Pearson and others, Phil. Trans., A, Vol. cxcvu, pp. 285—379, 1901; K. Pearson and 
A. Barrington, Eugenics Laboratory Memoirs, No. V, 1909. 

t Biometrika, Vol. 1x. pp. 446—472, 1913. 

§ With only one pair of measures the probability of spurious correlation is, in cautious work, very 
slight, for the possibility of differentiation can be easily tested by the critical comparison of the physical 
constants, 


|| Pearson, K., “On Homotyposis in Homologous but Differentiated Organs.” Roy. Soc. Proc. 
Vol, Lxxi. pp. 288—313, 1903. 
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concerning its yield in any subsequent year. If, on the other hand, the correlation be high, 
prediction from a few years’ test may be made with great probability of certainty. 


Given a measure of the “performance? of a series of varieties during a number of years it 
would at first seem quite allowable to form symmetrical tables or to use the intra-class formulae 
of a former paper* to determine the intra-varietal correlation, and to regard this as a satisfactory 
measure of the differentiation of the varieties and of the average prediction value of a year’s test. 
Such is, however, not the case, for while there may be no orderly change in yield throughout the 
period under consideration, the individual years differ greatly in their average yield for all the 
varieties. The influence of this “disorderly differentiation” upon r is admirably shown by 
A. D. Hall’s+ table of the yield in bushels of wheat in the Rothamsted experiments. 


Let b=yield in bushels per acre of any one of m varieties in any one of n years, ¥;, y2 be the 
“first” and the “second” years of a symmetrical intra-varietal correlation surface, v1, 7, be the 
“first” and “second” varieties of a symmetrical intra-annual correlation surface. Then Toy, by 

2 


will be a (spurious) measure of the (persistent) differentiation of varieties, T,,b,.» & (Spurious) 
a 
measure of the differentiation (in the yield of all the varieties) of years. Applying formulae 


(v)—{ix) of Biometrika, Vol. 1x. p. 450, to these data, I find 
S[n (n—1)]=2128, 
S[(n—1) 3 (b')]=83122'5, S[(n—1) 3 (b2)]=3483626-4, 
Ss (b')P=3610204-57,  —-S [3 (b)] = 37082013, 


6=39°0613, o,2=111°257328, 
Toy, by. = — 032. 


The result is obviously spurious, for mere inspection of the entries in the table shows that 
some varieties regularly give heavier yields than others. The source of the spurious value is to 
be seen in the fact that an intra-class coefficient has been calculated from a symmetrical surface 
formed from classes (varieties) represented by a series of yields differentiated by annual variations 
in the growing conditions. By correcting for this source of differentiation by expressing each 
yield as a deviation from the mean yield of all the varieties for the particular year, i.e. b” =b—6,, 
where the bar denotes a mean and the subscript y that it is for all the yields of a year, I have 
found t 
To''y Wy = 266. 

Measuring the differentiation of years in terms of intra-annual correlation (intra-class correla- 
tion in which each class is defined by the year and its individuals are the yields of the different 
varieties grown), I find from Hall’s table 


S[m (m— 1)]= 4440, 
S[(m—1) 3 (b)]=174129°2, S[(m—1) > (b?)]=7317531°92, 
S[ (b') P=7586436-21, S[= (b'2)] =370820'13, 
b=39°2183, o;2=110°017719, 


Toy, by = ‘791. 


Since the varieties have been shown to be differentiated, this result must also be spurious. 
Let b’=b—b, where the v indicates that the mean denoted by the bar is for the yield of the 


* Biometrika, Vol. 1x. pp. 446—472, 1913. 

+ Hall, A. D., The Book of the Rothamsted Experiments, p. 66, 1905. 

t Science, N. 8. Vol. xxxvi. pp. 318—320, 1912. Probably a better method of dealing with euch 
cases will sometime be found. So far I have not succeeded. 
Biometrika x 53 
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variety for all the years it was grown. Correcting for the influence of the differentiation of 
varieties in this way I have found* 
om ° "7 
19" 837. 


Thus season is a far more important factor than variety in determining an individual yield. 


Illustration II. Influence of Personal Equation upon the Correlation between the Grades 
assigned to the Same Paper by a Series of Instructors. 


Stripped of the verbiage in which it has been clothed in discussions among pedagogues, one 
of the chief problems concerning the reliability of the grades assigned in examinations resolves 
itself unto the statistical question: What is the correlation between the grades assigned to the 
same paper by different instructors ? 


Let g be the grade assigned to any one of m papers by any one of x instructors, let 7, 7 be 
the “first” and “second” instructor (of a symmetrical intra-class table) passing judgment 
upon a paper, ~;, po the “first” and “second” paper graded by the same instructor. Then 
from Table I of D. Starch+ I deduce, by the intra-class formulae (v)—(lx) of Biometrika, Vol. 1x. 
p- 450, 


= 659, 071. 


"Yi1 Vin "Op, Ip. 


By using the deviation method as illustrated above, [ have found 
r, = °732, = 386. 


0" 9"in T9'y,9"'Do 


TABLE IL. 


Grades of Papers Assigned by Various Instructors. 




















Instructors. 
hee | ’ a a 

1 | 2 3 | yj 7) | 6 ‘4 7.9 9 10 =(9) | 
| 
1 s5 | s6 | ss | 85 | 75 | 80 | 88 | 87 | 8 | 87 846 | 
ae 77 | 80 | 87 | 80 | 62 | 82 | 82 | 87 85 | 87 809 | 

bie ae 74 | 78 | 78 | 7 | 69 | 84 | 91 | 83 | 79 | 80 791 

E 5 6 | 6 | 62 | 20 | 296 | 60 | 55 | 68 | 55 | 50 526 
o 5 6s | s2 | 78 | 82 | 64 | 88 | 8 | 86 | 78 | 80 791 | 
a 6 94 | 87 | 93 | 87 | 88 | 77 | 89 | 88 | 88 | 89 875 | 

o 7 88 90 | 95 87 79 85 96 91 87 89 887 

8 80 | 84 | 73 79 | 72 | 88 | 8 | 91 | 77 | 76 800 

9 7 70 | 68 | 50 | 44 | 6 | 7% | 81 | 79 | 79 681 

| 10 93 | 92 | 85 | 92 | 81 | 83 | 92 | 89 | 84 | 8 876 

i } 

. | | } | 

2(g) | 794 | 814 | 807 | 737 | 655 | 787 | 838 | 851 | 797 | 802 | 7882 
—— : u = = uJ | m Sve 4 


Both of these results, in which an attempt was made to correct for the personal equation of the 
instructors in determining the correlation between the estimates of different instructors on the 
same paper, or to correct for the differences in merit of the papers in testing the individuality of 
the instructors, are higher than the raw values given above, which are clearly spurious. Similar 
results t are obtained from Jacoby’s astronomical grades §. 

* Science, loc. cit. 

+ Science, N. 8. Vol. xxxvimr. p. 630, 1913. 

+ Personally, I can attach little pedagogical significance to series as short as those of either Starch 
or Jacoby. They serve here as illustrations of method merely because I know of no more extensive 
series. 

§ Science, N. S. Vol. xxx. p. 819, 1910. 
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The essentials of this note may be summarized as follows: 


In using Intra-class coefficients care must be taken to guard against spurious values arising 
through differentiation among the individuals of the class. 

Besides the orderly differentiation (due to age of individuals, position of organs on axis, etc.) 
for which Pearson has determined corrective formulae in terms of correlation coefficients, a 
disorderly differentiation for which such corrective formulae have not as yet been found some- 
times obtains. Illustrations of such cases are here given. 

Probably the empirical methods used here in correcting for this disorderly differentiation 
should be replaced by formulae with a sounder theoretical foundation. This I have not as yet 
been able to do. 

The purpose of this note will have been served if it directs attention to a source of danger 
which may sometimes be encountered in the use of serviceable formulae, and indicates a method 
by which in the absence of more perfect methods practical results may be secured. 


Cop Spring Harpor, N.Y. 
February 3, 1914. 


II. On an Extension of the Method of Correlation by Grades 
or Ranks. 


By KARL PEARSON, F.R.S. 


In a memoir published in 1907* I have shown how, on the hypothesis of normal distribution, 
the true correlation of variates * may be ascertained from the correlation p of grades. If 
9; and gz be the two grades, v, and v2 the corresponding ranks, 7 and y the corresponding variates 
with means # and ¥, and standard-deviations a; and o2, while 


, 1 a2) = Orr 2 
Wo tp (5- +5) 


= e ~1-r? \o) eyo. § ae 


Qroyoq JT — 72 


is the normal frequency surface of the variates, then 


I 


7_ = 2 9 y 
Nn=sNn =925 Fo =%q° tz, 


=i 


yj=v=9+h, 1=¥),=4 (N+1), 


o*, = 0%, = ps (N?—1). 


Further I showed in the memoir just cited that 


* «©On Further Methods of Determining Correlation,” Drapers’ Company Research Memoirs (Dulau 
and Co.), pp. 11, 12. 
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where a convenient method of finding p was by the formula 


p= Bios 6S n= 92)" 


v4 


6S (v,— ve) 
or again by eS vn we ; 


The problem has recently occurred of dealing with data where : 
(i) One variate is given quantitatively, the other variate is given by ranks. 
For example, place in school-class has to be considered in relation to marks in examination, 


or the rank in a teacher’s general appreciation has to be considered in relation to marks in 
examination. 


(ii) One variate is given by broad categories, the other by ranks. 
For example, five or six categories of general intelligence are given as the basis of the 
’ 5 
teacher’s classification of intelligence, and this has to be considered with regard to rank in, 
say, class or examination, possibly with regard to a special subject. 
We require in both cases to deduce from the data the true variate correlation. 


Case (i). Let « be the character measured by its grade, y the character given quantitatively. 
Then with the notation above, if p’ equal the correlation of grade and of variate, 7 the corre- 
lation of the two variates : 

1__ Poy 
abess WN. ’ 
4 TygTy 
where 
rts: fae x, : 
Po.w= 2(y—¥) (1-9) dedy, 
J -@Y 


—-@ 


dp, y “i / ae x cs OO 
ov — —7) lady. 
Re: Bete ILE Yr Gp Ory 
Integrating by parts after putting ¥y=0 and writing 
dz dz 
—_— “hs * 
dr "'"? dedy’ 


Mpg, y | +2 | +e di, dz 
ov , - dxdy. 
3s a 710 T dy‘ vay 


Integrating again by parts: 
5 i} d 


- ~ rad 
to (to NH -3-5 
| ——*s “oy zdady 


” 


a, Orz'y’ y ) 
en re ee , 3 
1-1 1-1} da’ dy 





eS en ae 


—o 27 J] _78 








ot O N? 1 
Von J1—7? =p 
(l-—ry? { 
Hence dp! _ ad Poy \ 
dr dr No» oo, 





* Phil. Trans. A., Vol. 195, p. 25. 
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Thus since p’ vanishes with 7, 


[2 
o 
—f, 


r=a/ — 


If we used ranks instead of grades, we should have oy, for a, and consequently 


r Rf | at 
=, ~~ 
' x J/N2-1 
r= “ve p’ = 10233 p’, 


IN2—] 
ea r= 7 1 1.0233 p”. 


Thus finally 


It will be clear from this that the correlation p’ between rank and quantitative variate can 
never be “ perfect,” for it cannot exceed the value ‘9772, otherwise the correlation r would exceed 
unity. It will be seen that for practical purposes 7 is very close to p’, but still from the 
theoretical standpoint, it is not without interest to discover that the correlation between 
ranks and a quantitative variate can never be perfect. For example, it is impossible to have 
perfect correlation between place in class and examination test, even if the boys were in the 
same order in'class and examination. The defect, however, will be very slight. 

Case (ii). Let the subscript C refer to any “broad” class and let » be found from either 
of the formulae 


19 12 ¥ {- —\9 
or n?*= WUN? a” {Ne (Ve-V)*}, 
the first applying to grades and the second to ranks; then 


Tas 
r= 1°0233 Ja S {Ne (Ge -9), 


; _ rrr 
i =10233 ./ 2 sin. 66-9)%, 


according as grades or ranks are used. In actual practice the values of 7’ or 7” should be 
correct for number of classes and for “broad” categories. See Biometrika, Vol. vut. p. 256 and 
Vol. rx. p. 118. 


Numerical illustrations will be provided later. 


III. Correction of a Misstatement by Mr Major Greenwood, Junior. 


In a recent paper by Mr Major Greenwood and Mrs Frances Wood “On changes in the 
Recorded Mortality from Cancer and their Possible Interpretation*” occur the following words : 
“The case is evidently analogous to that studied by Professor Karl Pearson in his pamphlet, 
The Fight against Tuberculosis and the Death-rate from Phthisis (Dulau and Co.). Professor 
Pearson published three diagrams: (a) the general death-rate of England and Wales; (b) the 
phthisis death-rate ; (c) the ratio of phthisis deaths to all deaths. The original figures seem to 
have been the crude rate for males and females separately from 1835 onwards.” The “evident 
analogy” with what appears to me the wholly fallacious treatment of the authors in their paper 
above cited I do not now stay to discuss, but I wish to draw attention to the words: “The 


Royal Society of -Medicine, Proceedings, Vol. vu. 
March 27, 1914. 


Section of Epidemiology, pp. 79—170. 














Miscellanea 419 


original figures seem to hive been the crude rate for males and females separately from 1835 
onwards.” Why the writer of these words should have assumed them without any inquiry 
of me, or any examination of the values of the crude death-rates (which are accessible to every- 
body) to be “crude death-rates,” I do not know, but they illustrate his readiness to form a 
biased judgment when his feelings are stirred by unfavourable criticism. As a matter of fact 
the rates were standardised rates reduced to the population of 1901, and most kindly 
provided at my special request by the General Register Office. It is of interest to observe that 
Dr Weinberg of Stuttgart—recently made precisely the same charge as Mr Major Greenwood 
with the same over-hasty assumption that the reality must be the desired, if undemonstrated, 
error*, With the German as with other foes, it is well to leave ample opportunity for their 
assuming you to be foolish ; their assumption may lead them to run against hard reality. 


a. F. 


IV. Note on Reproductive Selection. 
By DAVID HERON, D.Sc. 


The fact that in the case of mant fifty per cent. of one generation comes from twenty-five 
per cent. of the preceding one was first noted by Professor Karl Pearson in the Chances of 
Death (Vol. 1. p. 80) and in dealing more fully with this important generalisation in the Ground- 
work of Eugenics, p. 27, he said: “It is very difficult from any English statistics to determine 
how many adults never marry. No information on this point is asked in the death schedule for 
males; it is asked but imperfectly answered in the case of the schedule for females.” In 
a footnote he adds: “The Registrar-General informs me that the record of civil condition 
in the case of female deaths is worthless and that no useful return can be made from it.” 
He found that in the Argentine and in Scotland 60 per cent. died unmarried, in the United 
States 51 per cent., and from the last two English Censuses and the Annual Reports 48 per 
cent., and added “This indirect method of reaching the result is, however, not very satisfactory. 
We may, I think, conclude in round numbers that 40 per cent. of the population dies before it 
reaches the age of 21 and that probably another 20 per cent. are never married.” On this 
assumption Professor Pearson proceeds to show that “about 12 per cent. of all the individuals 
born in the last generation provide half the next generation.” 

Some data published in Bulletin of Population and Vital Statistics No. 30 for the Common- 
wealth of Australia (Tables 48 and 84 a and b) prove that the assumptions made lie very close to 
the facts. The data are shown in the following table which gives the conjugal condition and 
issue of the males and females who died in Australia in 1912. From this we find that half the 
total number of children came from 3337 of the parents (all those who had at least 9 children 
and part of those who had each 8 children). It thus appears that of the males 17,404 out 
of 30,285=57°5°/, died unmarried while half the total offspring came from 25-9 °/, of those who 
married and 11°0°/, of the whole number of males, so that approximately three-fifths of the 
males born die unmarried and one-half of one generation comes from one-quarter of the married 
population or from one-ninth of all the males born in the preceding generation. The diagram 
gives a graphical illustration of the argument. 

In exactly the same way we find that nearly one-half of the females born in Australia die 
unmarried and that one-half of one generation comes from one-quarter of the married and from 
one-seventh of all the females born in the preceding generation. 


* Archiv fiir Rassen- und Gesellschafts-Biologie, 1x. Jahrgang, 8. 87. Leipzig, 1912. 
+ It has also been dealt with in various mammals. See the Groundwork of Eugenics, Eugenics 
Lecture Series 1 (Dulau and Co.), p. 29. 
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Conjugal | Children in || Deaths of Total Deaths of Total 
Condition | Each Family Males Children Females | Children 





Single 0 17404 








Married 1422 
1036 
1098 
1127 
1147 
1070 
1058 
1040 

992 

819 

801 

473 

394 

196 

109 

50 
27 


CoA Nite So M@e OS 














as 
Totals 30285 69089 | 62824 | 








Diagram to illustrate the fact that three-fifths of those born die unmarried and that one-ninth of 
one generation produce one-half of the next. (Deduced from records of males.) 
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